Accepted for publication in the Astrophysical Journal; 2008 June 26 

Preprint typeset using 1^1^^ style cmulatcapj v. 10/09/06 



00 

O 
O 
(N 

(N 

6 



SPITZER 'S CONTRIBUTION TO THE AGN POPULATION 
J. L. Donley, ^ G. H. Rieke, ^ P. G. Perez-Gonzalez, G. Barro^ 

Accepted for publication in the Astrophysical Journal; 2008 June 26 

ABSTRACT 

Infrared selection is a potentially powerful way to identify heavily obscured AGN missed in even 
the deepest X-ray surveys. Using a 24 /j,m-selected sample in GOODS-S, we test the reliability and 
completeness of three infrared AGN selection methods: (1) IRAC color-color selection, (2) IRAC 
power-law selection, and (3) IR-excess selection; we also evaluate a number of infrared excess ap- 
proaches. We find that the vast majority of non-power-law IRAC color-selected AGN candidates in 
GOODS-S have colors consistent with those of star-forming galaxies. Contamination by star-forming 
galaxies is most prevalent at low 24 /j,m flux densities 100 /^Jy) and high redshifts (z ~ 2), but the 
fraction of potential contaminants is still high 50%) at 500 /iJy, the highest flux density probed 
reliably by our survey. AGN candidates selected via a simple, physically-motivated power-law crite- 
rion (PLCs), however, appear to be reliable. We confirm that the infrared excess methods successfully 
identify a number of AGN, but we also find that such samples may be significantly contaminated by 
star-forming galaxies. Adding only the secure 5piteer-selected PLC, color-selected, IR-excess, and 
radio/IR-selected AGN candidates to the deepest X-ray-selected AGN samples directly increases the 
number of known X-ray AGN (84) by 54 — 77%, and implies an increase to the number of 24 /xm- 
detected AGN of 71 — 94%. Finally, we show that the fraction of MIR sources dominated by an AGN 
decreases with decreasing MIR flux density, but only down to /24 ^m — 300 /iJy. Below this limit, 
the AGN fraction levels out, indicating that a non-negligible fraction (~ 10%) of faint 24 ^m sources 
(the majority of which are missed in the X-ray) are powered not by star formation, but by the central 
engine. The fraction of all AGN (regardless of their MIR properties) exceeds 15% at all 24 /im flux 
densities. 

Subject headings: galaxies: active — infrared: galaxies — X-rays: galaxies 
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1. INTRODUCTION 

Identifying complete and reliable samples of AGN has 
become a necessity for extragalactic surveys, whether the 
goal be the selection of AGN candidates or the removal 
of AGN "contaminants" . Only when armed with com- 
plete samples of AGN will we be able to determine the 
role of obscured accretion in the build-up of the present 
day black hole mass function, or accurately character- 
ize the star-formation history of the universe. Complete 
AGN samples are also required to test proposed evolu- 
tionary theories in which black hole formation and star- 
formation are intimately linked by merger and feedback 
processes (e.g. Hopkins et al. 2006), ultimately produc- 
ing the correlation between black hole mass and bulge 
velocity dispersion (Ferrarese & Merritt 2000; Gebhardt 
et al. 2000). Unfortunately, the varied luminosities, ac- 
cretion rates, orientations, and intrinsic obscurations of 
AGN prevent any one selection technique from reliably 
identifying all of them. For instance, while current UV, 
optical, and X-ray surveys are capable of detecting unob- 
scured AGN, they miss many of the obscured AGN and 
nearly all of the Compton-thick AGN thought to domi- 
nate AGN number counts at both low and high redshift 
(e.g. Gilh et al. 2007; Daddi et al. 2007a,b). Like- 
wise, only 10-15% of AGN are radio-loud, making radio 
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surveys relatively incomplete. 

The Multiband Imaging Photometer (MIPS; Rieke et 
al. 2004) and Infrared Array Camera (IRAC; Fazio et al. 
2004) instruments aboard Spitzer have provided sensi- 
tive surveys in multiple mid-IR bands. Infrared selection 
with MIPS and IRAC data is being used widely to select 
AGN candidates independently of their optical and/or X- 
ray properties. In addition to identifying AGN in fields 
with little or no X-ray data, infrared selection criteria are 
capable of identifying heavily obscured AGN missed in 
even the deepest X-ray fields (e.g. Donley et al. 2007). 
As such, IR selection has the potential to complement 
traditional AGN selection methods and to yield a more 
complete census of AGN activity. 

In this paper, we critically review the following in- 
frared selection criteria: (1) IRAC color-color selection, 
(2) IRAC power-law selection, and (3) IR-excess selec- 
tion. The first selection method employs color cuts in 
two representations of IRAC 4-band mid-infrared (MIR) 
color-color space (Lacy et al. 2004; Stern et al. 2005), 
the second identifies AGN whose IRAC SEDs are well-fit 
by a power-law (Alonso-Herrero et al. 2006, Donley et 
al. 2007), and the third selects red galaxies with large in- 
frared to UV/optical flux ratios (Daddi et al. 2007a, Dey 
et al. 2008, Fiore et al. 2008, PoUetta et al. 2008). The 
first two criteria are based on the same principle: the hot 
dust near an AGN's central engine reprocesses absorbed 
UV, optical, and X-ray emission into short-wavelength 
MIR emission, filling in the gap between the stellar emis- 
sion that peaks near 1.5 fim and the long- wavelength dust 
emission features that dominate the SEDs of star-forming 
galaxies. The color-color and power-law selection crite- 
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ria, however, differ in the range of mid-IR characteris- 
tics they include as possible AGN indicators. The third 
selection method identifies sources in which heavy ob- 
scuration with reemission in the infrared diminishes the 
optical emission and/or enhances the infrared emission. 

This paper utilizes improved spectral templates with 
a sample of infrared color-selected, power-law galax- 
ies (PLGs), and IR-excess galaxies in the ultra-deep 
GOODS-S field to test the reliability and completeness 
of these selection techniques over a wide range of sam- 
ple properties. Prom this analysis, we then quantify the 
contribution of these approaches plus Spitzer identifica- 
tion of radio-intermediate and radio-loud AGN to the 
X-ray-selected AGN population. The paper is organized 
as follows. In §2, we describe the selection of the color- 
selected, PLG, and IR-excess samples. The construction 
of high-reliability photometric redshifts is described in 
§3, as are the overall redshift properties of the sample. In 
§4, we briefly discuss the X-ray properties of our MIPS- 
selected sample. The infrared color selection criteria of 
Lacy et al. (2004) and Stern et al. (2005) are discussed in 
§5, where we compare and contrast the two selection cri- 
teria, investigate the behavior in color space of the star- 
forming templates, determine the redshift and flux de- 
pendencies of the color selection techniques, and investi- 
gate the properties of the most secure color-selected AGN 
candidates. In §6, we discuss the PLG selection criteria, 
and in §7 we investigate the IR-excess sources. Finally, in 
§8, we discuss the overall statistics of AGN revealed by 
IR-based methods compared with X-ray-selected sam- 
ples. A summary is given in §9. Throughout the paper, 
wc assume the following cosmology: (nrn,f^Aj -f^o)=(0.3, 
0.7, 72 km s-i Mpc-i). 

2. SAMPLE SELECTION 
2.1. Multi-wavelength Data 

We take as our initial GOODS-S sample all MIPS 

sources detected at 24 fim to a flux density of /24 > 
80.0 /xJy. At this flux limit, 99% of the MIPS sam- 
ple are detected to > IOct. There are several advan- 
tages to choosing a flux-limited MIPS sample. First, 
AGN (and LIRGS/ULIRGS) tend to be bright at 24 /im 
(e.g. Rigby et al. 2004). Selecting only those galax- 
ies with /24 ^m > 80.0 /uJy therefore retains all but the 
faintest AGN whfle excluding 50-60% of IRAC-selected 
IR- normal star-forming galaxies at all flux densities. Sec- 
ond, a MIPS flux-limited sample is not subject to the 
complicated slope-dependent selection bias prcscint in 
IRAC selected samples due to the significant variation 
in the sensitivity of the 4 IRAC bands (see Donley et 
al. 2007). Third, this selection gives a complete and 
well-defined sample of objects with extreme red R — [24] 
colors. 

The MIPS 24 fj,m catalog of the GOODS Legacy team 
(Dickinson et al., in prep.) is comprised of 948 sources in 
the MIPS CDF-S Legacy field with > 80.0 fiJy. While 
this MIPS depth can be obtained over the full CDF-S, 
we chose to limit this study to the GOODS region to 
take advantage of the super-deep IRAC imaging. The 
relative depths of the limiting MIPS flux and the super- 
deep IRAC photometry ensure that essentially all MIPS 
sources have high S/N IRAC SEDs, allowing us to study 
in an unbiased way the IRAC properties of this flux- 



limited MIPS sample. To ensure that all AGN candidates 
have sufficient X-ray coverage, we also required X-ray 
coverage of Tx > 250 ks from the deep 1 Ms CDF-S 
X-ray dataset (see Giacconi et al. (2002) and Alexander 
et al. (2003)). Despite this relatively low cut, 96% of 
the final MIPS sample have Tx > 0.5 Ms and 80% have 
Tx > 0.75 Ms. The resulting sample was drawn from an 
area of 195.3 sq. arcmin and contains 846 MIPS sources. 

The CDF-S is one of the best-imaged fields in the 
UV, optical, NIR, and X-ray. We took advantage of 
this extensive multiwavelcngth dataset by creating an 
aperture-matched catalog using the UV-NIR photome- 
try of Marzke et al. (1999, BJz), Vandamc et al. (2001, 
JK), Arnouts ct al. (2002, UUpBVRI), C0MB017 
(Wolf et al. 2004), Giavalisco et al. (2004, bvizJHK), 
Le Fevre et al. (2004, /), and GALEX (FUV, NUV) (see 
Perez- Gonzalez et al. 2005 and the UCM Extragalactic 
Database^ for more details). We then removed from our 
sample the 89 MIPS sources that had multiple optical 
counterparts that were (1) within a 2.5" search radius of 
the MIPS source and (2) separated by more than 0.5". 
(At r < 0.5", it is difficult to distinguish between miflti- 
ple counterparts and an extended/irregular source with 
multiple components.) While we do not restrict our sam- 
ple to regions covered by GOODS ACS imaging, 83% of 
the MIPS sources in our sample have deep ACS coverage. 

SExtractor-selectcd IRAC sources were similarly 
matched to the MIPS sample after combining the super- 
deep IRAC data with data from the deep, wide-area, 
Spitzer Legacy Program (PI: van Dokkum) and the 
MIPS-GTO IRAC program (see Perez-Gonzalez et al. 
2008 for further details). To ensure accurate MIR SEDs, 
we removed from the IR PLG and color-selected samples 
32 sources with blended IRAC or MIPS photometry. Of 
the remaining 725 MIPS sources that meet our criteria, 
713 (98%) have unique IRAC counterparts within a 2" 
search radius, and 699 (96%) have > 5(t IRAC detec- 
tions in all 4 IRAC bands, allowing us to determine ac- 
curately the MIR colors of essentially all members of the 
MlPS-selected sample. Of the 12 sources without IRAC 
counterparts, 6 were not detected due to blending with 
a nearby source, 4 had badly centered MIPS positions, 
and 2 had faint IRAC counterparts that fall below our 
catalog limit. 

2.2. Power-law, color- selected, and IR-normal samples 

After assigning IRAC counterparts, we separated the 
MIPS sample into three subsets: IR PLGs, IR color- 
selected galaxies, and IR-normal galaxies. We defined as 
PLGs sources whose 4-band IRAC photometry is well-fit 
by a line of slope a < —0.5, where fv oc (Alonso- 
Herrero et al. 2006; Donley et al. 2007). The effect of 
different cuts in a will be discussed in §6. To ensure a 
good fit, we required the chi-squared probability Px (the 
probability that the fit would yield a chi-squared greater 
than or equal to the observed chi-squared) to exceed 0.1. 
Px tends either to lie close to 0.5 (the probability that 
corresponds to a reduced chi-squared of 1) or is very small 
(see Bevington & Robinson 2003). This selection, which 
identified 55 PLGs, was done using the linfit task in 
IDL. This task takes the following 4-band input from 
IRAC: 

http://t-rex.fis.ucm.es/ pgperez/Proyectos/databaseuse.en.html 
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X = log(i/) (1) 

y = iog(f.) (2) 

Ay = [l/ln(10)] * AU/U (3) 

and returns the best fit slope, a, and the chi-squared 
probability, Px- While sources selected via this method 
arc dominated by the AGN in the mid-IR, wc do not re- 
quire that the power-law extend into the optical. Conse- 
quently, many infrared-selected PLGs are dominated by 
stellar emission at wavelengths short of ~ 2 yum, where 
the reprocessed emission from hot dust is suppressed be- 
cause of dust sublimation. 

Color-selected galaxies were defined as sources that 
meet the AGN IRAC color-cuts of Lacy ct al. (2004) 
or Stern et al. (2005), but that do not meet the PLG 
criterion. As discussed in Alonso-Herrero et al. (2006) 
and Donley et al. (2007), both the PLG and IRAC 
color cuts attempt to select luminous AGN that out- 
shine their host galaxies in the infrared, filling in the dip 
in a galaxy's SED between the short-wavelength stellar 
emission feature and the long-wavelength dust emission 
features. As such, the IRAC ACN color selection regions 
contain, but also extend beyond, the power-law locus 
in color space. While nearly all PLGs meet the IRAC 
AGN color selection criteria, not all color-selected galax- 
ies meet the PLG criteria. We therefore separate color- 
selected sources that can be identified via a power-law fit 
(PLGs) from those that can not (color-selected galaxies). 
The color-selected sample consists of 210 sources, 188 of 
which meet the Lacy et al. criteria, 72 of which meet the 
Stern et al. criteria, and 50 of which meet both criteria. 

Finally, we define IR-normal galaxies as sources that 
meet neither the IRAC PLG nor the color selection crite- 
ria; these sources comprise the remaining 448 galaxies in 
the MIPS sample. As they are not the focus of this study, 
we have not checked their IRAC and MIPS photometry 
by eye. Instead, we estimate that the fraction of IR- 
normal galaxies with blended IRAC or MIPS photometry 
is similar to that found for the PLG and color-selected 
samples, 11%, or ~ 50 galaxies. We caution that IR- 
normal does not mean purely star-forming. Instead, 'IR- 
normal' only indicates that any mid-IR emission from an 
AGN is overwhelmed by emission from the host-galaxy. 
In fact, many Type 2 and Seyfert- luminosity AGN meet 
the IR-normal criteria (Stern et al. 2005, Donley et al. 
2007, Cardamone et al. 2008). 

2.3. IR-Excess Galaxies 

In addition to dividing the MlPS-selected sample into 
the 3 sub-samples discussed above, we identified IR- 
excess galaxies using the criteria of Daddi et al. (2007a) , 
Dey et al. (2008), Fiore et al. (2008), and PoUetta ct al. 
(2008). (We also searched for bright ULIRGS that met 
the IR-excess criteria of Yan et al. (2007), but found 
none in our faint sample.) Daddi et al. (2007a) se- 
lected galaxies in GOODS (with a 3cr 24 /zm flux limit 
of 15-30 /xJy) whose total MIR-I-UV star-formation rate 
(SFR) exceeds the dust-corrected UV SFR by a factor 
of > 3, and estimate that at least ^ 50% of their sam- 
ple are Compton-thick AGN. Dey et al. (2008) select 
sources with R - [24] > 14 (/24 /xm//fl > 1000) and 
/24 fjtm > 300 /uJy, criteria which yield a sample of both 



heavily obscured AGN and star- forming galaxies. Fiore 
et al. similarly require that /24 ^m//i? > 1000, but also 
include an optical/NIR criterion of i? — i^T > 4.5 and ex- 
tend their selection to fainter 24 jim fluxes of /24 /xm > 
40 /iJy. They estimate from simulations that 80% of the 
sources selected via these criteria are obscured AGN. Pol- 
letta et al. (2008) focus only on the most luminous AGN 
(/24 /Ltm ^ 1 mJy) with large infrared to optical flux ra- 
tios, whose IRAC and MIPS colors can be described by 
the following criteria: fs.s/fs.e > 2, /8.0//4.5 > 2, and 

log [/8.0//3.6] + log[f24/f3.6] > 2. 

Because IR-excess sources tend to be optically- faint, 
we use the approach of Fiore et al. (2008, private com- 
munication) and estimate the R-band magnitudes by in- 
terpolating the ACS v- and i-band data from the MU- 
SIC catalog (Grazian et al. 2006). Despite the inher- 
ent uncertainties associated with this relatively simple 
method, the interpolated R-band magnitudes are in ex- 
cellent agreement with ground-based R-band measure- 
ments for bright sources, and they greatly improve upon 
the uncertain R-band estimations at faint flux densities. 
Because the MUSIC catalog is based on identifications 
at z and K (the latter of which is universally bright for 
the IR-excess sources), the use of the MUSIC catalog for 
these sources also ensures that the correct optical coun- 
terpart is chosen, as verified by a visual inspection of 
the sources selected via the Fiore et al. criteria. The 
only disadvantage of this method is that, of the 195.3 
sq. arcmin of our survey, only 132.7 sq. arcmin (68%) 
are covered by the deep ISAAC K-band data, which was 
also used in the selection of the Daddi et al. sources. Our 
identification of the Dey et al., Daddi et al., and Fiore et 
al. IR-excess samples is therefore limited to this region. 

Of the 465/713 sources in our MlPS-selected sample 
that lie in the ISAAC field, 10 meet the Dey et al. cri- 
teria, 52 meet the Fiore et al. criteria, and 42 lie in 
the Daddi et al. IR-excess sample (the list of which was 
kindly provided by D. Alexander, private communication 
2008). In addition, 71 MIPS sources have red IR/optical 
colors of /24 jjLTsil fn, > 1000: we will refer to this sample 
as 'IR-bright/optically-faint'. Of the full sample of 713 
sources, 5 meet the PoUetta et al. criteria. The proper- 
ties of these IR-excess galaxies, nearly all of which also 
meet the power-law or color-selection criteria outlined 
above, will be discussed in more detail in §7. 

3. REDSHIFTS 

While the redshift coverage in the CDF-S is amongst 
the highest in all cosmological fields, only 34% of the 
sources in our faint sample have spectroscopic redshifts. 
As one of our main goals is to investigate the redshift- 
dependency of infrared color selection, we require both 
accurate and complete redshift information. However, 
the mean magnitude of the sources without spectroscopic 
redshifts, F ~ 24.1, is very faint, making further spectro- 
scopic follow-up challenging. We therefore supplement 
the spectroscopic redshifts with photometric ones. The 
sources that arc the fociis of this study generally have 
SEDs with weak stellar features and require extra care 
to fit accurate photometric redshifts. To insure this ac- 
curacy, we have used two complementary photo-z ap- 
proaches and have incorporated two stages of indepen- 
dent visual inspection by two reviewers, as described in 
Appendix A. The final results, shown in Figure 1, sup- 
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Spectroscopic Redshift 

Fig. 1. — Photometric vs. secure spectroscopic redshifts for PLGs 
(navy stars), color-selected galaxies (red triangles), and IR- normal 
galaxies (green crosses). Values are plotted in log(l + z). Dot- 
dashed and dotted lines give 10% and 20% errors on Az, respec- 
tively. 89% of the sample have A(z)<0.1, and 95% have A(z)<0.2. 

port the high accuracy and completeness of the adopted 
redshifts for this faint sample. 

Of the 713 MlPS-selected sources in our sample, 249 
have secure spectroscopic redshifts from the VVDS (Le 
Fevre et al. 2004),VLT/FORS2 (Vanzella et al. 2006), 
K20 (Mignoh et al. 2005), and Szokoly et al. (2004) 
redshift surveys. For our photometric redshifts, the 
mean offset is A(z) = 0.012, where A(z) — [zp — 
-I- Zs), and the dispersion is CTz = 0.15, where 
— (1/iV) X] ^{'^Y ■ Eighty-nine percent of the sample 
have A(2:)<0.1, and 95% have A(z)<0.2. As shown in 
Table 1 , while the redshift completeness and accuracy are 
high for the IR-normal and color-selected samples, they 
drop significantly for the PLG sample. A comparison of 
their spectroscopic and photometric redshifts illustrates 
some of the issues. Three of the four PLG outliers are 
best fit by a Type 1 QSO template, whose redshift is par- 
ticularly difficult to constrain. However, of the 18 PLGs 
for which only photometric redshifts arc available, only 
2 (11%) are fit by a Type 1 QSO template. The remain- 
der show optical features/breaks that make their redshift 
determination more secure. As such, we expect the over- 
all accuracy of the PLG redshifts to be higher than one 
would assume given the limited comparison with spec- 
troscopic redshifts. 

With the addition of 400 photometric redshifts, com- 
puted as described above, our redshift completeness is 
91%. The redshift distribution is shown in Figure 2. Also 
plotted in Figure 2 is a redshift histogram that incorpo- 
rates typical errors in the photometric redshifts. To pro- 
duce this distribution, we simulated 10,000 redshift dis- 
tributions in which the photometric redshifts were ran- 
domly varied according to the appropriate a given in 
Table 1. We plot in Figure 2 the mean of the resulting 
distributions. Both redshift distributions show a strong 
peak at z = 1, and while the first shows a potential peak 
at z ~ 2, we cannot confirm its presence due to the errors 



on the photometric redshifts. 

We separate the sample in Figure 2 into the PLG, 
color-selected (Lacy et al. and Stern et al.), and IR- 
normal subsamples. As expected, the number of IR- 
normal galaxies peaks at z = 0.7 (a well-known redshift 
peak in the CDF-S) and at z = 1.1 (another known red- 
shift peak). We also detect a peak at z = 0.3 similar 
to that found by Desai et al. (2008), whose strength 
increases if we apply their cut of /24 /xm > 300 ^Jy. In 
the CDF-S, however, this peak is dwarfed by the stronger 
peak a.t z — 0.7, even at large flux densities. The redshift 
distribution of the IR-normal galaxies decreases rapidly 
at redshifts of z > 1.2, where only highly luminous star- 
forming galaxies are detectable. In contrast, the PLGs 
with redshift estimates have a relatively flat distribution 
in redshift space, as was found for the PLGs in the CDF- 
N (Donley et al. 2007). 

The redshift distributions of the (non-power-law) Lacy 
et al. and Stern et al. selected samples differ signifi- 
cantly, both from the IR-normal and PLG samples, as 
well as from one another. The Stern-selected galax- 
ies peak at z 1.25 whereas the Lacy-selected galax- 
ies show significant peaks at z ~ 0.5 and z ~ 2, with 
very few galaxies falling in the z < 1 regime originally 
probed by these selection methods (see §5). In addition, 
at z > 1.75, nearly all (94%) MlPS-selected sources meet 
the Lacy AGN selection criteria, regardless of their na- 
ture. A large concentration of galaxies aX z ^ 2 was 
previously observed in this field by Caputi et al. (2006), 
and in a brighter sample of MIPS sources by Desai et 
al. (2008), and is most probably due to the 7.7 /xm aro- 
matic feature passing through the 24 /zm band. This 
behavior suggests that the mid-IR continua of the Lacy 
color-selected galaxies contain substantial contributions 
from star-formation (e.g. Genzel et al. 1998). 

Finally, the mean redshifts of the IR-excess samples 
are as follows: z — 1.92 ± 0.36 for the Daddi et al. 
sources, z = 2.19 ± 0.61 for the dust obscured galaxies 
(DOGs, Dey et al. 2008), z = 2.09 ± 0.48 for the Fiore- 
selected sources, z = 2.11 for the one PoUetta source 
with a known redshift, and z — 2.05 ± 0.49 for the IR- 
bright/optically- faint sources. As discussed above, this 
concentration about a redshift of 2 (when not a design 
of the selection as in Daddi et al. (2007)) is likely to be 
due at least in part to the passage of the 7.7 /xm aro- 
matic feature through the MIPS 24 /im band, suggesting 
a significant contribution from star-formation. 

4. X-RAY PROPERTIES 

Of the 713 MIPS sources, 109 (15%) have X-ray coun- 
terparts in the Alexander et al. (2003) or Giacconi et 
al. (2002) catalogs. Of these, 25 are PLGs, 35 are color- 
selected galaxies (33 from the Lacy criteria, and 12 from 
the Stern criteria), and 49 are IR-normal. While the IR- 
normal galaxies therefore dominate the X-ray counts, the 
fraction of such sources is low: only 11% of the IR-normal 
galaxies have X-ray counterparts, as compared to 17% of 
the color-selected galaxies and 45% of the PLGs. 

We test for faint (> 2a) X-ray emission from the 
MIPS sources using the procedure outlined in Donley 
et al. (2005). The resulting detection fractions for the 
PLG, color-selected, IR-normal, and IR-excess samples 
are given in Table 2. With the inclusion of the weakly- 
detected X-ray sources, the detection fractions of the 
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Fig. 2. — Redshift distribution of the MlPS-selected sample. The plot on the left gives the observed distribution, while the plot on the 
right incorporates the typical errors on the photometric redshifts. Prom lightest to darkest shading, the histograms represent all MIPS 
sources in our sample, IR-normal galaxies, color-selected sources that meet the Lacy et al. (2004) criteria, color-selected sources that meet 
the Stern et al. (2005) criteria, and PLGs. 



IR-iiormal, color-selected, and PLG samples increase to 
40%, 42%, and 64%, respectively. As indicated in Table 
2, however, while all strongly and weakly-detected PLGs 
have AGN X-ray luminosities of log L^icigs s^-'^)> 42, 
the same is true for only 74% of the color-selected galax- 
ies, and 34% of the IR-normal galaxies. Given the in- 
frared luminosities implied by our 24 /im selection crite- 
rion, the portion of the sample with log Lx(ergs s^^)< 42 
will be heavily contaminated with star-forming galaxies 
(Ranalli et aL 2003). 

While the X-ray detection fraction of PLGs is relatively 
low (though significantly higher than that of the color- 
selected galaxies) , it is comparable to that of previously- 
selected samples (Alonso-Hcrrero ct al. 2006, Donley ct 
al. 2007). Of the PLGs in the 2 Ms CDF-N, 55% 'had 
high-significance X-ray counterparts. However, only 15% 
remained undetected down to the 2.5c7 detection level, 
suggesting that many PLGs are likely to be heavily ob- 
scured X-ray sources whose fluxes fall below the current 
detection limits (Donley et al. 2007). In the CDF-S, only 
45% of the PLGs have cataloged X-ray counterparts and 
36% remain undetected down to 2a. This slightly lower 
detection fraction is due at least in part to the lower X- 
ray exposure of the 1 Ms CDF-S, and will be discussed 
further in §6. 

The X-ray detection fraction of the luminous IR-excess 
sources selected via the PoUetta et al. (2008) sample is 
high (80%). However, the same can not be said for the 
remaining IR-excess samples: only 30% of the DOGs, 
19% of the Fiore et al. sources, and 15% of the sources 
with high 24 /im to optical flux ratios have cataloged 
X-ray counterparts. By definition, none of the Daddi et 
al. sources have cataloged hard X-ray counterparts, al- 
though 3 have soft-band counterparts. When the weakly- 
detected X-ray counterparts are included, these numbers 
rise to 100% for the PoUetta sources, 63% for the DOGs, 
44% for the Daddi et al. sources, 43% for the Fiore et 



al. sources, and 42% for the IR-bright /optically- faint 
sources. 

5. IRAC COLOR-COLOR SELECTION 

There are several reasons why a re-examination of the 
color-color selection criteria is needed before applying 
these techniques to our sample. First, thanks to the 
availability of high quality NIR and Spitzer MIR spec- 
tra and photometry, we can now construct more ac- 
curate MIR templates for both AGN and star-forming 
galaxies than were available when the Lacy et al. and 
Stern et al. AGN selection criteria were initially de- 
fined. Second, the AGN selection criteria of Lacy and 
Stern were initially designed for use with shallow sur- 
veys, and have not yet been properly tested over a range 
of both redshift and flux density (but see Cardamone et 
al. 2008). Third, photometric redshift techniques for 
Spitzer-detected galaxies have advanced sufficiently to 
allow nearly complete redshift estimation. 

The positions of the MlPS-selected sample in IRAC 
color space arc shown in Figure 3, where stars repre- 
sent the PLGs and circles represent the remaining MIPS 
sources. As expected, all PLGs meet the Lacy criteria 
and all but 3 meet the Stern criteria. The relatively large 
scatter of the PLGs on the Stern plot probably arises in 
part from the use of adjacent color bands, whereas the 
power-law fitting tends to smooth over noise. 



5.1. Comparison of the IRAC color-color selection 

criteria 

To illuminate the strengths and weaknesses of the two 
color-color selection criteria in Figure 3, we separate the 
sources that meet both the Lacy et al. (2004) and Stern 
et al. (2005) AGN selection criteria from those that meet 
only one of the two criteria. The color-selected sources 
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Fig. 3. — Position in Lacy et al. (2004; left) and Stern et al. (2005; right) color-space of the MlPS-selected sample, where PLGs are 
given as stars. Blue filled circles represent color-selected galaxies that meet both the Lacy et al. and Stern et al. criteria. Green triangles 
and red squares represent sources that meet only the Lacy et al. or Stern et al. criteria, respectively. The small open circles are IR-normal 
galaxies. The shaded regions represent the AGN selection regions, and the diagonal lines within are the loci of perfect power laws with 
a = -0.5 to -3.0. 

that meet both criteria primarily define an extension of 
the power-law locus to a blue slope of a = -1-0.5 (recall 
that our definition of PLGs includes only those sources 
with red slopes of a < —0.5). We will discuss in detail 
the effects of different power-law slope criteria in §6. 

The Lacy-only sources occupy two regions in color- 
space. The first, located in the lower portion of the Lacy 
diagram, corresponds to the concentration in the upper- 
left corner of the Stern et al. selection region. These 
sources were intentionally excluded from the Stern selec- 
tion region to minimize contamination from high-redshift 
{z ~ 2) star-forming galaxies. The second concentra- 
tion of Lacy-only galaxies is scattered throughout the 
star-forming locus of the Stern diagram. Are these nor- 
mal, low-z star-forming galaxies, or obscured AGN not 
selected by the Stern criteria? 

The Stern-only sources fall almost exclusively in the 
lower left portion of the Stern wedge, and occupy the re- 
gion dominated by low-redshift IR-normal galaxies in the 
Lacy et al. color diagram, suggesting that, like the Lacy 
criteria, the Stern et al. criteria are likely to suffer from 
low-redshift star-forming galaxy contamination. Because 
of the exclusion of z 2 star- forming galaxies, however, 
the Stern et al. criteria should perform better at high 
z. To test this hypothesis, we next consider the evo- 
lution in IRAC color-space of a number of high-quality 
star-forming and AGN templates. 

5.2. Star-forming Templates 

As discussed above, the availability of high-quality 
near- and mid-infrared data has allowed the construc- 
tion of high-quality star-forming SEDs, particularly for 
luminous and ultra-luminous infrared galaxies (LIRGS 
and ULIRGS). In Figure 4, we plot the redshift evolu- 
tion of these templates in IRAC color-color space over 
z = — 4, where purple and blue tracks represent the 
purely star-forming ULIRG and LIRG templates of Rieke 
et al. (2008, see Table Al), green tracks represent the 



spiral and starburst templates of PoUetta et al. (2007) 
and Dale & Helou (2002), and red tracks represent the 
elliptical templates of Silva et al. (1998). Large circles 
mark the tracks at z = 0, and small circles mark each 
integer redshift from z = 1 to 4. 

While the star-forming templates generally avoid the 
power-law locus itself, they enter the Lacy and Stern se- 
lection regions at both low and high redshifts, tracing out 
the same regions in color space occupied by many of the 
color-selected AGN, particularly those selected via only 
one of the two criteria. The templates therefore sug- 
gest potential star-forming galaxy contamination of the 
color-selected AGN, as previously predicted by Barmby 
et al. (2006), Donley et al. (2007), and Cardamone et al. 
(2008), and indicate that the current AGN selection re- 
gions may inadequately separate AGN and star-forming 
galaxies. 

Our results can be compared with the simulations of 
Sajina, Lacy, & Scott (2005), who calculated mathe- 
matical models of galaxies from three spectral compo- 
nents: stars, aromatic features, and a continuum. For 
< z < 1, we agree with their Figure 8 (upper left) 
that starburst luminosity galaxies do not significantly 
'invade' the Lacy AGN color wedge (see our Figure 4). 
However, we find that more luminous galaxies can invade 
this wedge much more seriously (again see Figure 4) . The 
difference may arise because the inputs to their models 
included few star-forming LIRGs and only one ULIRG 
(Arp 220, an atypical case). Therefore, it is likely that 
the behavior of the most luminous star forming galaxies 
is not captured as accurately in their models as is that 
of lower luminosity ones. By z = 1, the typical Spitzer 
24 /im survey sensitivity limit reaches only to the bot- 
tom of the LIRG range, so the Lacy AGN color wedge is 
more susceptible to contamination than was concluded 
by Sajina et al. (2005). 

5.3. Redshift- dependent color selection 
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intervals of 1. The power-law locus with a = —0.5 to —3.0 is shown as a line inside the shaded AGN selection regions. The star-forming 
SEDs enter the AGN selection regions at both low and high redshift. 




While the star-forming templates appear to trace quite 
well the positions of many of the color-selected galax- 
ies in color-color space, Figure 4 covers a wide range of 
redshifts (z = — 4). To understand better the over- 
lap between the star-forming templates and the color- 
selected galaxies (and PLGs), we must take the redshift 
information into account. We therefore break the sam- 
ple down into smaller redshift bins, as shown in Figures 
5 and 6. We overplot on the color-color diagrams the 
redshift-appropriate colors of purely star-forming galax- 
ies. To simplify the plots, we do not plot each galaxy 
track separately, as was done in Figure 3, but instead 
draw 1 a contours around the tracks, where a is taken to 
be the median IRAC measurement error of the full MIPS 
sample. We find that, unlike the PLGs, the majority 
of the color-selected AGN candidates fall within or very 
close to the contours for star-forming galaxies of simi- 
lar redshifts. Thus, it is likely that their mid-IR SEDs 
are dominated by star formation. This result suggests 
that simple mid-IR color-color cuts cannot identify reli- 
able AGN samples without also including redshift-based 
or additional SED (e.g. power-law) criteria. We discuss 
in Appendix B the individual redshift intervals. 

The summary of our findings can be found in Table 3, 
where we present the overall fraction of color selected 
galaxies and PLGs that lie outside the 1, 2, and 3(t 
star-forming contours. The top half of the table gives 
the fractions assuming no errors on the photometric red- 
shifts; the lower half incorporates 10% errors which for 
clarity are imposed on the templates rather than on the 
individual galaxy measurements. For the PLGs, we give 
two percentages: the fraction of sources that lie outside 
the star-forming contours in Lacy and Stern color-space, 
respectively. 

As is clear from Table 3, the fraction of color-selected 
galaxies that lie outside the star-forming contours is 
lower than that of the PLGs by a factor of 2-10 at all 
three levels of significance. For instance, while 57% - 
71% of PLGs lie more than 3(7 from the star-forming 



contours in Lacy color space, the same can be said for 
only 5%-10% of the color-selected galaxies. In addition, 
increasing the significance from la to 3(t has a far greater 
effect on the reliability of the color-selected galaxies than 
on that of the PLGs, especially in Lacy color-space, in- 
dicating that the color-selected galaxies lie noticeably 
closer to the star- forming templates than do the PLGs, 
as expected. We note that this analysis depends on the 
templates being used, as the addition or removal of any 
one star-forming template will result in changes in the 
numerical results presented above and in the table. The 
overall trends discussed above, however, will remain the 
same. 

5.4. Properties of Color- selected AGN candidates 

While the majority of color-selected AGN candidates 
lie inside the star- forming contours (see Table 3) , a num- 
ber of non-power-law color-selected galaxies meet the 
AGN selection criteria and have colors inconsistent with 
those of our star-forming templates. Are these AGN, 
as predicted? How do their redshift distributions. X-ray 
detection fractions, and numbers compare to those of 
the PLGs? In the following discussion, we define as 'se- 
cure' color-selected galaxies (and PLGs) those AGN can- 
didates that lie > la away from the redshift-appropriate 
star-forming templates. 

As discussed above, and as shown in Figure 2, the red- 
shift distributions of the color-selected sources show a 
strong peak a.t z — 2.0, attributed to the 7.7 //m aro- 
matic feature, a star-formation indicator. In contrast, 
the PLGs have a relatively flat redshift distribution, indi- 
cating little or no contribution from aromatic features. If 
we do not account for errors in the photometric redshifts, 
the resulting sample of 'secure' color-selected galaxies 
retains the large z — 2 peak, suggesting a large star- 
formation contribution. If we incorporate 10% errors on 
the redshifts of the star-forming templates when select- 
ing our secure candidates, however, the redshift distri- 
butions of the remaining color-selected galaxies become 
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Fig. 5. — Position in Lacy et al. (2004) color-space of the MlPS-selected sample, as a function of redshift, where PLGs are shown as stars, 
X-ray-cataloged sources are given by filled (red) symbols, and X-ray weakly-detected sources are given as small cyan symbols. Overplotted 
are the redshift-appropriate contours representing the IRAC colors of purely star-forming templates, assuming no errors on the photometric 
redshifts. The thick (purple) contours and dot-dashed (green) contours in the z = 1.75 — 2.25 redshift bin represent star-forming galaxy 
templates for which 10% errors were incorporated into the redshift range, and AGN templates, respectively. 



relatively flat, with mean redshifts of z = 1.59 ± 0.83 
and z — 1.62 ± 0.84 for the Lacy and Stern-selected 
sources, respectively, suggesting that a significant frac- 
tion of the high-redshift star-forming contaminants have 
been removed. For comparison, the mean redshifts and 
rms ranges of the secure PLGs selected in Lacy and Stern 
color-space are 1.98 ± 0.87 and 2.09 ± 0.78. 

While the redshift distribution therefore suggests that 
many of the star-forming contaminants have been re- 
jected from the secure sample, the X-ray detection frac- 
tions, shown in Table 4, suggest otherwise. At distances 
of 1 and 2 a from the star- forming templates, the X-ray 
detection fraction of the PLGs exceeds that of the color- 
selected galaxies by factors of > 2 — 3. This factor, how- 
ever, should be taken with reservations; X-ray sources are 
far more likely to have spectroscopic redshift estimates, 
and are therefore far more likely to be included in our se- 
cure sample, which requires redshift information. While 
this has a minimal effect on the color-selected galaxies. 



for which the redshift completeness is high, it has a large 
effect on the PLGs, boosting their probable X-ray detec- 
tion fraction. 

A better comparison is therefore with the IR-normal 
galaxy population, 11% of which are detected in the X- 
ray. The X-ray detection fractions of the secure color- 
selected AGN candidates (14-29%), exceed this value, 
but only by a factor of 1 — 3. In comparison, the 
full sample of PLGs has an X-ray detection fraction of 
45%, which exceeds that of the IR-normal galaxies by a 
factor of 4. Only at the highest significance, 3(7 from the 
star-forming contours, does the X-ray detection fraction 
of the secure color-selected galaxies approach that of the 
full PLG sample. 

This significant offset in X-ray detection fraction could 
be due either to a difference in intrinsic luminosity or to 
lingering contamination of the color-selected sample by 
star-forming galaxies. While the average X-ray luminos- 
ity of the secure PLGs, log L^{eTgs s^^)= 44.1, exceeds 
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Fig. 6. — Position in Stern et al. (2005) color-space of the MlPS-selected sample, as a function of redshift. Symbols and contours are as 
described in Figure 5. 



that of the secure Lacy and Stern-selected sources de- 
tected in the X-ray, log Lx(ergs s~^)= 43.8 and 43.9, re- 
spectively, the offset is relatively small, suggesting that 
the large discrepancy in the X-ray detection fractions is 
not driven primarily by a systematic offset in X-ray lumi- 
nosity of the relevant AGN. Instead, it likely arises from 
the inclusion of star-forming galaxies, even in the la 'se- 
cure' color-selected population, suggesting that a larger 
cut in (T (e.g. 3cr) is required to define a reasonably secure 
sample. 

Also shown in Table 4 are the total number of X-ray 
selected galaxies in both the secure color-selected and 
PLG samples. After applying completeness corrections 
for the fraction of Lacy, Stern, and PLG X-ray sources 
that have redshifts and that can therefore be included 
in the secure sample (94%, 83%, and 76%, respectively), 
the number of secure X-ray selected PLGs exceeds that 
of the secure X-ray-detected color-selected galaxies, re- 
gardless of whether we define the secure sample as those 
sources that lie 1, 2, or 3cr from the star- forming con- 
tours. This indicates that the PLG selection criterion 



identifies the overwhelming majority of secure A GN can- 
didates in IRAG color-space. 

5.5. Flux dependency of color selection 

While the Lacy and Stern AGN selection criteria were 
defined using relatively shallow surveys, these selection 
techniques are now being applied to samples with a range 
of flux densities (e.g. Gardamone et al. 2008). How does 
the limiting flux affect the completeness and reliability of 
AGN color selection? Are the problems discussed above 
present in shallow as well as deep surveys? 

There are several reasons why we might expect to see 
a shift in reliability with flux. First, the AGN fraction 
of MIPS sources depends quite strongly on the 24 /^m 
flux density (e.g. Treister et al. 2006, Brand et al. 
2006). For instance, while only ^ 4% of MIPS sources 
at our flux limit of 80 /xJy are expected to be AGN 
(Treister et al. 2006), the fraction at 5 mJy is more 
than an order of magnitude greater 45%). Therefore, 
MlPS-selected shallow surveys should contain fewer star- 
forming galaxies, although those that remain are likely 
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to be LIRGs/ULIRGs that enter the AGN selection re- 
gion in greater numbers than spirals and starbursts (see 
Figure 4). 

Second, many of the 'problem' sources that lie in- 
side the AGN color selection region but that also fall 
inside the star-forming contours have moderately high- 
redshifts (z > 1.25), and arc therefore likely to drop out 
of shallow surveys. While shallow samples may there- 
fore contain high-luminosity ULIRGS, the exclusion of 
lower-luminosity high-redshift galaxies reduces the risk 
of contamination. Other flux cuts, such as the R mag- 
nitude < 21.5 requirement of Stern et al. (2005), also 
prevent contamination at high-z, as normal galaxies at 
this magnitude are detected only to z ^ 0.6. 

To investigate the effect of intermediate flux density 
cuts on the AGN color selection, we show in Table 5 the 
fraction of 'secure' color-selected and PLGs (those that 
lie outside the la star-forming contours) as a function of 
flux density. If we do not incorporate errors in the photo- 
metric redshifts into our definition of the secure sources, 
we find that the fraction of secure sources is relatively 
constant at ~ 50% regardless of flux density. If we al- 
low errors on the photometric redshifts when construct- 
ing our 'secure' sample, the reliability of Lacy-selected 
sources at / > 80 /xJy drops to 29%, and that of Stern- 
selected sources drops to 21%. At / > 500 ^Jy, the 
fraction of secure sources amongst both samples rises, 
but only to 50%. Regardless of our assumptions, there- 
fore, the fraction of potential contaminants is still high 
(~ 50%), even at the highest fluxes probed by our survey. 
Because of the pencil-beam nature of the CDF-S survey, 
95% of the sources in our sample have /24 ^mi < 600 /iJy, 
so our ability to comment on brighter samples is limited. 

It is also worth noting that the brightest MIPS sources 
tend to lie above the power-law locus in Lacy color-space 
and to the right of the power-law locus in Stern space, 
regions where we expect minimal contamination from 
star-forming galaxies. Not surprisingly, these sources are 
also almost always detected in the X-ray, suggesting that 
these regions of IRAC color-space are the most secure. 

5.6. Comparison with previous work 

In a study of 77 AGN candidates selected from the 
Spitzer First-Look (Lacy ct al. 2005) and SWIRE sur- 
veys (Lonsdale et al. 2003) via the Lacy et al. (2004) cri- 
teria. Lacy et al. (2007) found that 33% are unobscured 
type 1 quasars, 44% are type 2 AGN. and 14% arc dust- 
reddened type 1 quasars. Only 9% have star-forming or 
LINER spectra. Is this relatively low contamination by 
star-forming galaxies consistent with our flndings? 

There are three main factors that lead to the high relia- 
bility of the Lacy et al. (2007) sample. First, the sample 
members are very bright, with a typical 24 /xm flux den- 
sity of 5 mJy. As discussed above, Treister et al. (2006) 
predict that 45% of sources at this flux density should be 
AGN, regardless of their MIR SEDs, compared to only 
4% of sources at our flux limit of 80 /iJy Indeed, of the 
brightest 50 sources in the XFLS (whose median 24 /im 
flux density is 9.4 mJy), 58% arc optically classified as 
AGN (Lacy et al. 2007). Thus, the sample from which 
these color-selected AGN was drawn contains far fewer 
star-forming galaxies than the deeper GOODS sample. 

Second, while the sources in the Lacy et al. (2007) 
sample have redshifts ranging from z = 0.053 to 4.27, the 



median rcdshift of the sample is low: z — 0.6. Only 5 
color-sclcctcd sources have redshifts in excess of z = 1.75, 
the redshift above which nearly all sources in our sample 
(AGN or star-forming) meet the Lacy et al. criteria. We 
would therefore expect little or no contamination by high- 
redshift star-forming galaxies, the predominant source of 
contamination in our faint sample. 

The third and most important reason for the high reli- 
ability of this sample, however, is the high PLG fraction 
of the Lacy-selected AGN candidates. Of the 77 sources 
in the Lacy et al. (2007) sample, 59 are PLGs as defined 
in §2. The 18 sources that are excluded by the PLG 
criteria include the only two starburst galaxies detected 
in this subsample, one starburst/LINER, an unclassified 
high-redshift galaxy with narrow UV emission lines, two 
composite galaxies, 8 Type 2 AGN (the identification of 
4 of which were based on a BPT analysis), 3 reddened 
Type 1 AGN, and 1 Type 1 AGN. If we relax the prob- 
ability constraint of the PLG criterion to Px > 0.01, as 
was done in Alonso-Herrero et al. (2006), we recover 69 
of the 77 Lacy et al. (2007) sources selected for opti- 
cal follow-up. Excluded arc the two starburst galaxies, 
one high-z unidentified galaxy, three Type 2 AGN, one 
reddened Type 1 AGN, and 1 Type 1 AGN whose slope 
of Q = —0.41 falls just short of our cut of a < —0.5. 
The high reliability of this luminous color-selected sam- 
ple is therefore consistent with our findings in §6 that 
while non-PLG color-selected AGN are subject to con- 
tamination by star-forming galaxies, sources selected via 
a power-law criterion are reliable. 

6. IRAC POWER-LAW SELECTION 

Thus far, we have focused primarily on the reliability 
of the AGN color selection criteria of Lacy et al. (2004) 
and Stern et al. (2005). In doing so, we have separated 
out the subset of sources that meet the power-law criteria 
of Alonso-Herrero et al. (2006) and Donley et al. (2007). 
Here, we discuss the PLG selection itself, in particular 
the consequences of various choices of limiting power-law 
slope, a. 

Our default cut of a < —0.5 was chosen to match the 
spectral indices of typical AGN (e.g. Alonso-Herrero et 
al. 2006, Donley et al. 2007). In the optical, AGN 
have spectral slopes of a = 0.5 to -2 (SDSS, Ivezic et 
al. 2002), with a mean value of a ~ —1 (Neugebauer et 
al. 1979, Elvis et al. 1994). In the IRAC bands, broad- 
line AGN exhibit similar slopes, with a mean value of 
a = -1.07 ± 0.53 (Stern et al. 2005). 

In Figures 7 and 8, we plot PLGs selected via cuts in a 
ranging from -1-0.5 to -1.5. At high redshift {z ~ 1.5 — 2) 
the IRAC bands trace the blue side of the stellar bump. 
It is therefore not surprising that at the bluest slopes 
(a = -1-0.5), the PLG sample is dominated by the pop- 
ulation of high redshift X-ray-non-detected star-forming 
galaxies discussed above. As the required slope red- 
dens towards our cut of a < —0.5, the high-redshift 
star-forming galaxies gradually drop out of the sam- 
ple, and the X-ray detection fractions rise from 21% 
at a = +0.5 to 30%, 45%, 67%, 88%, and 80% at 
a = 0.0,-0.5,-1.0,-1.5, and -2.0. While choosing a 
redder cut in a therefore increases the apparent reliabil- 
ity of the PLG selection, it also decreases the number 
of galaxies selected, and may exclude interesting heav- 
ily obscured. X-ray-non-detected AGN like those seen in 
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Fig. 7. — Position in Lacy et al. (2004) color space of PLGs as 
a function of the power-law slope cut, a. X-ray-detected sources 
are shown as filled (red) symbols. 



Fig. 8. — Position in Stern et al. (2004) color space of PLGs as 
a function of the power-law slope cut, a. X-ray-detected sources 
are shown as filled (red) symbols. 



the CDF-N (e.g. Donley et al. 2007). 

The X-ray-non-detected PLGs in the current sample 
tend to be the faintest sources both in the IRAC bands 
and at 24 fim, where their mean flux density, 146 /iJy, is 
over a factor of 2 lower than that of the X-ray-detected 
sample, 334 /itJy. The low X-ray detection fraction of 
these faint sources may therefore be due simply to their 
systematically lower fluxes. At z > 2.6 — 2.9, however, 
the star-forming ULIRG templates have IRAC SEDs that 
meet the PLG criteria, although dropping the power-law 
slope criterion to a < —1.0 and a < —1.5 raises this red- 
shift range to z > 2.74—3.92 (depending on the template) 
and to z > 3.4 - 4.2, respectively (with the IRAS 22491- 
1808 template never reaching an a of -1.5). It is therefore 
possible that the X-ray non-detected PLGs (which tend 
not to have redshift estimates, to be faint, and to lie at 
relatively high a) are high-redshift star- forming galaxies. 
To test this hypothesis, we plot in Figure 9 the ratio of 
the MIPS 24 ^ni flux density to that of the 3.6 ^m IRAC 
band. At the redshifts of interest (z > 2.6), this flux ra- 
tio allows a direct comparison of the hot dust emission 
at 5-7 /im to the stellar emission at ^ 1 fim. Overplot- 



ted on the colors of our MlPS-selected sample are the 
redshifted colors of the AGN templates of PoUetta et al. 
(2007, see Table 1) and the purely star-forming LIRG 
and ULIRG templates of Rieke et al. (2008). The mean 
colors of PLGs without redshift estimates are given by 
large symbols placed at z = 2.6, the redshift above which 
contamination by star-forming galaxies is possible. The 
circle, square, and triangle represent all PLGs without 
redshifts, those that are detected in the X-ray, and those 
that are not, respectively. 

As can be seen in Figure 9, the X-ray-detected and X- 
ray-non-detected PLGs lacking redshifts have consistent 
24 /im to 3.6 fim colors, suggesting that there is no signif- 
icant difference between these two sub-samples. In addi- 
tion, while the colors of these sources are consistent with 
those of AGN, they lie well above those of star-forming 
galaxies, not only at high redshift, but at all redshifts 
greater than z ^ 0.6. In other words, the PLGs that are 
undetected in the X-ray and that lack redshifts (prevent- 
ing us from testing their reliability as in §5.3) appear to 
have more hot dust emission than can be explained by 
purely star-forming templates, especially at high-z where 
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Fig. 9. — Observed 24 fim to 3.6 fim color as a function of 
redshift. The tracks represent the AGN templates of PoUetta et al. 
(2007, red) (see Table Al), and the purely star-forming LIRG and 
ULIRG templates of Rieke et al. (2008, cyan). Triangles represent 
all sources in our sample with redshift estimates, and filled (red) 
triangles indicate a PLG source. The mean colors of PLGs without 
redshift estimates are given by a large circle (all PLGs), a large 
square (X-ray— detected PLGs), and a large triangle (X-ray-non- 
detected PLGs). We arbitrarily placed these mean colors at a 
redshift of 2 = 2.6, above which contamination of the PLG sample 
by star-forming galaxies is possible. The colors of these sources, 
however, are consistent with AGN, not star-forming galaxies. 

star-forming contamination of the PLG sample is most 
likely. This result suggests that these sources are more 
likely to be AGN than star- forming galaxies. Other lines 
of evidence, e.g. variability (Klesman & Sarajedini 2007), 
and X-ray properties (Steffen et al. 2007), tend to sup- 
port this conclusion. In addition, we note from Figure 9 
that a number of the other galaxies at z ^ 2 have col- 
ors slightly redder than those predicted by star-forming 
galaxies at this redshift. This behavior may be due to a 
minor issue with the templates, a small AGN contribu- 
tion to the MIR flux density, or to reddening that exceeds 
that seen in our local LIRG/ULIRG templates. 

Finally, the segregation of the X-ray-detected and 
X-ray-non-detected sources about the power-law locus, 
seen in Figures 7 and 8, warrants discussion, as this was 
not seen in the CDF-N (Donley et al. 2007). This be- 
havior appears to be due largely to the different selection 
methods and limiting fluxes of the two samples. While 
the CDF-N PLG sample was selected on the basis of 
IRAC fluxes, the current sample was selected from a 
flux-limited MIPS sample with far deeper IRAC data. 
If we require the CDF-S sample to meet the IRAC de- 
tection limits of the CDF-N study, the PLG sample size 
decreases by a factor of 2 and the X-ray detection fraction 
rises to 78%. The segregation in the PLG sample is also 
greatly reduced, due largely to the loss of many of the 
X-ray-non-detected sources. This high X-ray detection 
fraction of 78%, however, is surprising as it is signifi- 
cantly higher than the 55% found for the CDF-N PLG 
sample. To test whether this change is a result of the 
much improved IRAC data, we recalculated the spectral 
properties of the galaxies in the CDF-N using the most 



current IRAC data, which now includes the super-deep 
GOODS-N coverage not previously available. Drawing 
PLGs from the same initial sample used in Donley et al. 
(2007), we find an updated X-ray detection fraction of 
57%, a value nearly identical to that found previously. 
For consistency with the current sample, we further re- 
stricted the PLG sample to those sources detected in the 
super-deep GOODS field, where the X-ray exposure is 
generally higher. Doing so raises the X-ray detection 
fraction to 65%. The remaining offset in the X-ray de- 
tection fractions of the IRAC flux-limited PLGs in the 
GOODS-N and GOODS-S fields, 13%, therefore appears 
to be due primarily to cosmic variance and small number 
statistics. 

7. IR-EXCESS SELECTION 

Both high-redshift star-forming galaxies and AGN can 
exhibit extremely red infrared to optical colors when 
heavily dust-obscured (e.g. Dey et al. 2008). While 
the relative fractions of AGN and star-forming galaxies 
among IR-excess samples is still a matter of debate, the 
fraction of dust-obscured galaxies (DOGs) with AGN- 
like power-law SEDs has been shown to decrease with 
decreasing flux from ^ 70% at / = 1 mJy to '-^ 20 — 25% 
at / < 300 ^Jy (Dey et al. 2008). We might there- 
fore expect a relatively small AGN fraction amongst the 
faint IR-excess sources in the GOODs fields, whose mean 
24 /im flux is 215 /xJy. Fiore et al. (2008), however, esti- 
mate that 80% of their red, IR-excess galaxies are heav- 
ily obscured, Compton-thick AGN. Here, we probe the 
nature of the IR-excess sources and examine the over- 
lap between the IR-excess galaxies and the X-ray, color- 
selected, and power-law samples discussed above. 

The positions of the IR-excess galaxies in Lacy and 
Stern IRAC color-space are shown in Figure 10, and their 
overlap with the X-ray, PLG, and color-selected samples 
is given in Table 2. Of the 101 IR-excess sources, 24 
(24%) are PLGs and all but 7 meet either the Lacy, 
Stern, or power-law criteria. At least 1/4 of the IR- 
excess galaxies therefore show evidence for AGN-heated 
dust (with the fraction rising to 100% for the luminous 
AGN selected via the PoUetta et al. (2008) criteria, as 
expected). Not all PLGs, however, have IR-excess col- 
ors. Of the 55 PLGs, only 44% meet one or more of 
the IR-excess criteria. The same can be said for 33% of 
the IRAC color-selected galaxies, but for only 2% of the 
IR-normal galaxies. 

Of the IR-excess sources, only 15% have cataloged X- 
ray counterparts, with the fraction rising to 45% when 
weak (> 2a) X-ray detections are considered (see Table 
2). Of the X-ray-cataloged sources, 47% are PLGs, 47% 
are color-selected galaxies, and 1 (6%) is IR-normal. The 
statistics are quite different for those sources with no 
cataloged X-ray emission, where the power-law fraction 
drops to 19% and the color-selected fraction rises to 73%. 

While a strong argument can be made for the AGN 
nature of the 32 IR-excess sources with cataloged X-ray 
counterparts and/or PLG SEDs, what can be said about 
the remaining IR-excess sources? As discussed in §3, 
the concentration of the redshifts of these sources about 
z — 2 may be due to the passage of the 7.7 fim aromatic 
feature through the 24 /im band, suggesting a significant 
contribution from star-formation. In addition, the vast 
majority of color-selected sources at z = 2 (of which 
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Fig. 10. — Position in Lacy et al. (left) and Stern et al. (right) 1 

these are a subset) have IRAC colors consistent with 
those of star-forming galaxies or low-luminosity AGN 
whose IRAC SEDs are dominated by the host galaxy. 
In combination with the lack of significant X-ray emis- 
sion, these facts suggest that the remaining 68% of the 
IR-excess sample could be either star-forming galaxies, or 
extremely obscured AGN. To distinguish between these 
two possibilities, we consider in more detail the sources 
selected via the Daddi et al. and Fiore et al. criteria. 



7.1. Daddi et al. (2007) Compton-thick AGN 
candidates 

Of the 88 IR-excess AGN candidates selected in the 
ISAAC region of GOODS-S by Daddi et al. (2007a), the 
list of which was kindly provided by D. Alexander (pri- 
vate communication, 2008), 42 fall in our MlPS-selected 
sample. Twenty-three of the remaining 46 galaxies have 
MIPS fluxes that fall below our cut of /24 /im > 80 ^Jy, 
and 6 galaxies lack MIPS and/or IRAC counterparts in 
our catalogs, indicating that they too are faint. Seven 
galaxies were excluded because of observable blending in 
the MIPS or IRAC bands, and an additional 10 were re- 
moved from our sample because of multiple optical coun- 
terparts. 

The positions of the Daddi et al. sources in IRAC 
color-space are shown in Figure 10. Thirty-eight of the 
42 Daddi et al. sources in our sample meet the Lacy 
et al. IRAC color-selection criteria. Only 10 meet the 
Stern et al. AGN selection criterion, and only 5 are 
PLCs (4 of which have relatively shallow slopes of only 
a ~ —0.6). In addition, only 15 (36%) meet any of the 
other IR-excess criteria. This is not surprising, as the 
MIR emission from the Daddi et al. IR-excess sources 
exceeds that of a typical star-forming galaxy by a factor 
of only > 3 (Daddi et al. 2007b). Of the 42 IR-excess 
galaxies, 18 have IRAC colors that lie > Icr from the 
redshift-appropriate star-forming contours in either Lacy 
or Stern color-space. Only 3 and 1, however, have colors 
that lie outside the 2(j and 3cr contours, respectively. As 
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discussed above, however, this indicates only that any 
AGN activity can not be identified on the basis of the 
MIR IRAC colors alone. 

To determine the nature of the Daddi et al. (2007) 
sources in our sample, we therefore turn to the X-ray 
data. While none of the IR-excess galaxies arc individ- 
ually detected in the hard X-ray band (by definition), 
3 have soft X-ray detections and 13 have faint (> 2(j) 
X-ray counterparts. Twenty remain X-ray-undetected, 
and 6 sources have a nearby X-ray counterpart that pre- 
vents an accurate test for low-o" X-ray flux. Using the 
procedure outhned in Steffen et al. (2007), with the 
only change being our slightly different choice of source 
aperture radius, 2", we coadded the three sources de- 
tected in the soft band, whose soft-band luminosities of 
Lx > 10^^ ergs s^^ indicate that they are AGN. We ver- 
ify that our stacking method reproduces the results of 
Daddi et al. (2007b) for the same sample of 59 IR-excess 
galaxies used in that work. While the coaddition of the 
3 soft X-ray detected sources did not lead to a hard band 
detection, we place a 3cr limit on the hard-band flux that 
constrains their column density to A^h x 10^^ cm~^, 
assuming an intrinsic photon index of F = 1.8. 

A coaddition of the 13 weakly-detected sources leads 
to a 3.3(7 hard-band detection, a 9.7(7 soft-band detec- 
tion, a hardness ratio of HR = —0.31, and a photon 
index of F = 1.4. If we assume that these sources are 
AGN at their mean redshift of 2 = 1.81, the hard to soft 
flux ratio corresponds to an obscured column density of 
TVh = 3.6 X 10^^ cm~^. At this modest obscured yet 
Compton-thin column density, the observed soft band 
flux is only attenuated by a factor of 2, implying that 
the sources must have relatively low luminosities. In- 
deed, the rest-frame 2-10 keV absorption-corrected lumi- 
nosity derived from the observed soft-band flux is only 
ix = 1-6 X 10'*2 ergs s'^. 

However, Daddi et al. (2007b) argue that the coad- 
ded soft X-ray flux of their full IR-excess sample can be 
attributed to star-formation. When they subtract this 
component from the detected fluxes, the hardness of the 
remaining X-ray emission implies a significantly larger 
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Fig. 11. — Rest-frame 2-10 keV luminosity vs. rest-frame 25 /xm 
power. The small symbols represent the staxburst and ULIRG 
samples of Pranceschini et al. (2003), Ranalli et al. (2003), and 
Persic et al. (2004), where starburst dominated sources are given 
as squares and circles, and AGN-dominated sources are given by 
crosses. The solid line gives the best-fit linear relationship between 
the X-ray and 25 fim luminosities of the star-formation-dominated 
sources. The large filled and open stars represent the full IR-excess 
and IR-normal samples from Daddi ct al. (2007b), respectively, and 
the large squares represent the X-ray weakly- and non-dctcctcd 
members of the Daddi ct al. sources in our sample. Because these 
sources comprise the brighter subset of the full Daddi et al. sample, 
we treat the full 25 fim flux density (derived from the coaddcd 
70 fim flux density given by Daddi et al. (2007b)) as a lower-limit. 

(e.g. Compton-thick) column density as well as a larger 
absorption-corrected luminosity. 

To test the origin of the X-ray emission, wc plot in Fig- 
ure 11 the 2-10 keV luminosity and 25 /im power {vLv) of 
a sample of starburst and AGN-dominated galaxies and 
ULIRGs drawn from Franccschini et al. (2003), Ranalli 
et al. (2003), and Persic et al. (2004). The 25 ^im lu- 
minosities were extracted from the IRAS Revised Bright 
Galaxy Sample (RBGS, Sanders et al. 2003) when avail- 
able, and from ISO (Klaas et al. 2001) or the IRAS 
Faint Source Catalog otherwise. As both the hard X-ray 
and MIR flux densities of star-forming galaxies trace the 
current star-formation rate (e.g. Ranalli et al. 2003, 
Pranceschini et al. 2003), these two luminosities are 
well correlated for starbursts and starburst-dominated 
ULIRGS (though there is a hint of a turnover to lower 
X-ray luminosities amongst the ULIRG sample). AGN, 
however, show an increased X-ray output for their ob- 
served 25 jjLVSi flux density. 

To compare the Daddi et al. sample to these local star- 
burst and ULIRG samples, we convert the observed soft- 
band (rest frame ~ 1.5—6 keV) luminosity to a rest-frame 
2-10 keV luminosity, and the observed 70 fim (rest-frame 
~ 23 (Um) flux density given by Daddi et al. (2007b) to 
a rest-frame 25 /xm power. The mean luminosities of 
the full Daddi et al. (2007b) IR-normal and IR-cxccss 
samples fall on the best-fit correlation for star-forming 
galaxies, confirming that their observed soft X-ray flux 
can be attributed to star-formation. The X-ray lumi- 
nosity of the X-ray weakly-detected IR-excess galaxies 
in our MIPS sample, however, is somewhat higher than 
predicted. The AGN origin of the soft X-ray emission is 



supported by the 2-10 keV luminosity implied by the soft 
X-ray flux, Lx = 1.6 x lO''^ ergs s^-^. If the soft X-ray 
flux has a signiflcant contribution from the AGN, then it 
cannot all be subtracted from the AGN spectrum, result- 
ing in column densities lower than estimated by Daddi 
et al. (2007b). 

However, the indicated AGN contribution is modest, 
and is based on an average. Therefore, it is likely 
that there will be a significant range of star-formation- 
corrected absorbing columns, including some that are 
Compton thick. Indeed, Alexander et al. (2008) show 
that a sample of 6 spectroscopically-confirmed Compton- 
thick AGN would be selected via the Daddi et al. (2007) 
method, although 4 of the 6 have infrared excesses of a 
factor of > 100, and thus represent the most extreme 
IR-excess sources. 

A coaddition of the 20 X-ray non-detected Daddi ct 
al. sources in our sample leads only to a marginal 2. Oct 
detection in the full band (0.5-8 keV), and 1.8cr and l.Ocr 
detections in the hard and soft bands, respectively. A 
3(7 limit on the soft flux gives a 2-10 keV luminosity of 
Lx < 3.8 X 10**^ ergs s"-'^, fully consistent with a star- 
forming origin (see Figure 11). (The 2-10 keV luminos- 
ity derived from the marginal full-band detection is even 
lower: Lx < 2.2 x 10''^ ergs s~^). The near-detection 
in the hard-band, however, suggests that these sources 
may have relatively hard X-ray spectra. By coadding the 
weakly and non-detected sources, the photon index drops 
to r = 1.04tE] j^, from a value of P = l-'i'^tol) mea- 
sured for the weakly-detected sources. There is there- 
fore marginal (Icr) evidence that the X-ray non-detected 
sources have spectra harder than those of their weakly- 
detected counterparts. Does this hard spectrum conflrm 
that these sources are AGN, or might there be another 
explanation? 

7.1.1. A Star- formation Origin? 

While it is well-known that heavily obscured AGN ex- 
hibit hard X-ray spectra, star-forming galaxies can also 
produce such spectra. As discussed in Persic & Rephaeli 
(2002) and Persic et al. (2004), the hard (> 2 - 10 keV) 
X-ray emission of star-forming galaxies is dominated by 
low and high-mass X-ray binaries (LMXBs, HMXBs), 
with the HMXB fraction increasing from ~ 20% at star- 
burst luminosities to ~ 100% at ULIRG luminosities. As 
shown by White, Swank, & Holt (1983), HMXBs have X- 
ray spectra with P = 1.2 ± 0.2, a cutoff energy of 20 keV, 
and an e-folding energy of ~ 12 keV (see e.g. Persic & 
Rephaeli 2002). At z = 2, the 0.5-2 keV and 2-8 keV 
X-ray bands sample the rest-frame 1.5-6 keV and 6-24 
keV X-ray bands, and therefore should be minimally af- 
fected by the power-law cutoff. A luminous star-forming 
galaxy undergoing an isolated burst of star-formation is 
therefore expected to display a hard X-ray spectrum of 
P = 1.0 — 1.4 over the energies observed in our sample. 
X-ray binary emission has been proposed as an explana- 
tion for the hard (P ~ 1.0) spectrum of Arp 220 (Iwasawa 
et al. 2001), the spectra of the apparently starburst- 
dominated ULIRGS of Ptak et al. (2003) and Teng et al. 
(2004), whose photon indices tend to lie at P = 1.0— 1.5, 
and the starburst-dominated ULIRGS of Pranceschini et 
al. (2003). 

While the hard X-ray properties of the IR-excess 
sources do not therefore require an AGN origin, can the 
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Fig. 12. — Median SEDs of the full Daddi ct al. (2007a,b) IR-normal and IR-cxccss GOODS-S samples, as given in Figure 4 of Daddi 
et al. (2007b). Overplotted are the purely star-forming SEDs of the starburst galaxy M82 (left) and LIRG NGC 3256 (right) (The Rieke 
et al. (2008) templates only extend down to ~ 0.4 ^m). We have applied an additional reddening of Ay = 0.8 to the NGC 3256 template. 
The open diamonds represents the flux that would be measured in the MIPS 24 fim band when the bandpass is convolved with the SED. 
The optical-MIR SEDs of both the IR-normal and IR-excess sources are well-fit by star-forming SEDs. 



same be said of their MIR properties? To test the na- 
ture of these sources, wc first plot in Figtirc 12 the me- 
dian SEDs of the full GOODS-S IR-normal and IR-cxcess 
samples, as given in Figure 4 of Daddi et al. (2007b). 
Although these high-redshift sources appear to be highly 
luminous (see Figure 11), there are indications that local 
templates for lower-luminosity galaxies are appropriate 
for them, at least in the optical-MIR (Rigby et al. 2008). 
It is therefore not surprising that the optical-MIR SED 
of the IR-normal galaxies is well-fit by the M82 SB tem- 
plate of Polletta et al. (2007). (The discrepancy between 
the observed SED and template at shorter wavelengths is 
likely due to a difference in the reddening and/or age of 
the stellar population.) The IR-excess SED is also very 
well fit by the purely star-forming template of the LIRG 
NGC 3256 (Rieke et al. 2008), to which we have appUed 
an additional reddening of Ay ^ 0.8. While this does 
not rule out an AGN contribution to the Daddi et al. 
IR-excess sample, it does indicate that an AGN need not 
be present to produce the observed SEDs of the IR-excess 
sources, and that their IR-excesses may simply be due to 
strong aromatic emission associated with their system- 
atically higher IR luminosities, as suggested by Daddi et 
al. (2d07a). 

If these IR-excess sources are dominated by star- 
formation, then it appears either that their IR SFRs have 
been overestimated or that their UV-derived SFRs have 
been underestimated. The former scenario could be par- 
tially attributed to the larger than unity slope between 
24 /xm luminosity and Paa-derived SFR (Alonso-Herrero 
et al. 2006, Calzetti et al. 2007), which indicates that as 
the SFR increases, an increasing fraction of the result- 
ing light is emitted in the MIR. A simple proportional 
relationship between SFR and IR luminosity (e.g. that 
of Kennicutt 1998) will therefore increasingly overpredict 
the SFR for more luminous infrared galaxies. 

An underestimate in the UV-derived star-formation 
rates could be due to the inherent difficulties in deter- 
mining accurate UV SFRs for luminous, heavily obscured 



galaxies (e.g. Goldader et al. 2002, Buat et al. 2005). 

While UV extinction is known to correlate with lumi- 
nosity (see Goldader et al. 2002; Vijh, Witt, & Gordon 
2003; Buat et al. 2005), Daddi et al' (2007a) find no dif- 
ference between the average derived Ai^oo values of the 
IR-normal and IR-excess samples, despite the systemat- 
ically higher 8 /im luminosities of the IR-excess sample 
(see Figure 2 of Daddi et al. 2007b). Under the as- 
sumption that the radio emission arises purely from star- 
formation, the radio luminosities of the 16 radio-detected 
IR-excess galaxies similarly indicate the UV SFRs have 
been underestimated by a mean factor of 6.4, and a me- 
dian factor of 3.5 (see Figure 12 of Daddi et al. 2007a). 
A stack of all IR-excess galaxies, however, leads to radio 
SFRs that are consistent with those derived in the UV. 

7.1.2. Summary 

We have divided the 42 Daddi et al. IR-excess galax- 
ies in our MlPS-selected sample into three subsamples: 
those that are X-ray detected (3), those that are weakly- 
detected in the X-ray (13), and those that remain unde- 
tected in the X-ray down to 2a (20). (The remaining 6 
sotirces lie too close to an X-ray source to test for faint 
emission.) The 3 X-ray detected sources are AGN with 
Lx > 10**^ ergs s~^, but have relatively low obscuration 
(iVn < 2 X 10^2 cm-2). 

A coaddition of the weakly-detected sources leads to a 
hard (F = 1.4) X-ray detection. If these sources are ob- 
scured AGN, the hardness ratio implies a column density 
of A'^H = 3.6 X 10^2 cm~2. A hard detection, however, 
could be attributed either to obscured AGN activity or 
to star-formation via HMXBs. Furthermore, the median 
SED of the ftdl Daddi et al. (2007b) IR-excess sam- 
ple is consistent with that of a local star- forming LIRG, 
suggesting that the MIR emission alone also cannot be 
used to rule out a star-forming origin. A comparison of 
the X-ray and MIR luminosities of these weakly-detected 
sources does not conclusively distinguish between a star- 
forming or AGN origin for the X-ray emission. There 
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arc therefore 3 possible explanations for the members of 
this sample of IR-excess galaxies: (1) the sources are 
Compton-thick AGN whose soft X-ray emission can be 
attributed to star-formation and whose hard X-ray emis- 
sion comes from the AGN, (2) they are relatively low- 
luminosity, Compton-thin AGN, whose soft X-ray emis- 
sion can not be entirely attributed to star-formation, or 
(3) they are star-forming galaxies whose soft and hard 
X-ray emission arise from star-formation. While the de- 
rived 2-10 keV X-ray luminosity of ~ 10"*^ ergs s~^ sug- 
gests explanation (2), it is not sufficiently high to defini- 
tively rule out the remaining scenarios. It is likely that 
no single possibility applies to all 13 galaxies, but that 
the possibilities define the range of their behavior. 

The X-ray non-detected sources, when coaddcd alone 
and with the X-ray weakly-detected sample, also appear 
to have a relatively hard X-ray spectrum. Their X-ray 
luminosity, however, is significantly lower than that of 
the weakly-detected sources, < 3.8 x 10^^ ergs s"""^. 
If the hard spectrum arises from HMXBs, the properties 
of this sample would be consistent with a star-formation 
origin. 

In summary, while we cannot rule out an obscured 
AGN origin for the Daddi et al. IR-excess sources in 
our MlPS-selected sample, their properties may also be 
consistent with a purely star-formation origin in the ma- 
jority of cases. Further analysis, taking into account the 
additional 1 Ms of X-ray exposure in the CDF-S, is there- 
fore required to establish their nature. 

7.2. Fiore et al. (2008) Compton-thick AGN candidates 

Fiore et al. (2008) select IR-excess galaxies with the 
following properties: /24 nm/fR > 1000 and R — K > 
4.5. Using the MUSIC catalogs (see §2.3), we selected 
64 sources that meet these criteria and that have 24 
flux densities > 80 fiJy. However, 9 of these sources 
were removed from our MIPS sample because of visible 
blending in the MIPS and/or IRAC bands (see §2), which 
may have been responsible for their anomalously high 
IR to optical flux ratios (unlike Grazian et al. (2006), 
we do not attempt to de-blend such sources). A visual 
inspection of the remaining sources led to the removal of 
3 additional sources with blended K-band fluxes, leaving 
52 high-quality IR-excess sources. 

The flrst Fiore criterion was designed to select obscured 
AGN with large X-ray to optical flux ratios (see their 
Figure 2) whose column densities tend to range from 
ATjj = 1022 to 10^^'' cm-2 (sec Fiore et al. 2008 and ref- 
erences therein) . The R ~ K criterion ensures that only 
extremely red objects (EROs) fall in the sample. The 
AGN amongst ERO samples tend also to be X-ray ob- 
scured with iVn = 10^2 - 1024 cm-2 (Brusa et al. 2005). 
Obscured AGN are therefore likely to be targeted by 
these criteria. However, these two selection criteria are 
known to identify both AGN and star-forming galaxies 
(e.g. Alexander et al. 2002, Doherty et al. 2005, Dey 
et al. 2008). We therefore examine whether these cri- 
teria are sufHciently stringent to exclude the possibility 
that the properties of these sources could arise from star 
formation. 

The first of the three main arguments for the Compton- 
thick AGN nature of the Fiore et al. sources is the abil- 
ity of the obscured AGN template of IRAS 09104-^41091 
(Pozzi et al. 2007) to reproduce the extreme colors of 




Fig. 13. — Redshifted (z = 1.5 — 2.5) tracks of the star-forming 
template IRAS 22491-1808 in Fiore et al. color space. Additional 
extinctions of Ay =0.0, 0.2, 0.4, 0.6, 0.8, 1.0, and 1.2 were applied 
via the SMC (solid, Prevot et al. 1984, Bouchet et al. 1985) or 
Calzetti (dashed, Calzetti et al. 2000) extinction laws. The dotted 
lines represent the Fiore et al. (2008) selection criteria, and circles 
represent the 52 sources in our sample that meet those criteria. 
The 10 X-ray— detected sources are given as filled circles. The star- 
forming template enters the selection region with only a modest 
(Ay ~ 1) amount of additional extinction. 

these sources, and the comparable inability of the M82 
and Arp 220 star-forming templates to do the same (see 
their Figure 3). When fitting SEDs to the sources in 
their sample, Fiore et al. find that only 36% of their X- 
ray non-detected sources are best-fit by elliptical, spiral, 
M82, N6090, or Arp 220 star-forming templates. Of the 
sources in our MlPS-selected sample that meet the Fiore 
et al. criteria and have good redshift fits, we find an even 
lower fraction of sources best-fit by these templates: 20%. 
However, as shown in Figure 13, the purely star-forming 
IRAS 22491 template of PoUetta et al. (2007) satisfies 
the Fiore et al. criteria when modest additional redden- 
ing (^v < 1-2) is allowed, indicating that the extreme 
colors of these sources can be reproduced not only by ob- 
scured AGN, but by highly reddened star-formation as 
well. Indeed, when we include this star-forming tem- 
plate, as well as the star-forming LIRG/ULIRG tem- 
plates of Rieke et al. (2008), the fraction of Fiore sources 
in our sample best-fit by a star-forming template rises 
from 20% to 66%. 

The second argument for the Compton-thick nature of 
the Fiore et al. sources is the high fraction of heavily 
obscured AGN (A^h = lO^'"' - lO^*^ cm^^) required to 
reproduce the hardness ratios and counts of the stacked 
X-ray emission. Using a simulation, Fiore et al. (2007) 
estimate that 80% of their sources are obscured AGN. 
The prediction for the X-ray non-cataloged sources in 
our sample is somewhat lower: 60%^^g^, although this 
fraction rises for the weakly-detected IR-excess sources 
to 80%+2ol private communication, 2008). 

To test this finding for the sources in our sample, we 
coadded the 36 X-ray non-cataloged Fiore et al. sources 
that lie far enough from known X-ray sources to allow an 
accurate test for faint X-ray emission. The coaddition 
gave a soft detection (0.5-2 keV, 3.9a), but only a weak 
hard detection (2-8 keV, 1.7o-). The resulting hardness 
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ratio, HR^ {H S)/{H + S) = -0.17 is only slightly 
harder than that of the 10 X-ray cataloged Fiore et al. 
sources, HR = —0.21, whose column densities fall in the 
unobscured to highly obscured (log Ny^ (cm~^)= 23.9) 
range (Tozzi et al. 2006), with a median value of log 
iVn (cm-2)= 22.6). 

If wc stack only the 10 soiirccs with weak X-ray coun- 
terparts (that also lie sufhciently far from known X- 
ray sources), however, we find a 5.3ct hard-band detec- 
tion, a 4.7(7 soft-band detection, and a hardness ratio of 
HR = 0.33, significantly harder than that of the X-ray 
cataloged sample. At a rcdshift of z = 2, this HR corre- 
sponds to a column density of A^h = 2.7x 10^'^ cm^^. The 
observed photon index, F = 0.33, indicates an obscured 
AGN origin, as it is inconsistent even with the mod- 
erately hard spectra of HMXB-dominated star-forming 
galaxies (F ^ 1.0 — 1.4). Stacking the remaining 26 X- 
ray-non-detcctcd Fiore sources does not lead to a detec- 
tion in any band. It therefore appears plausible, at least 
in this bright subset of the Fiore et al. sample, that the 
hard X-ray flux can be attributed to a small number of 
obscured, yet mostly Compton-thin, AGN, as opposed to 
a large number of obscured, Compton-thick AGN. 

The third argument for the AGN nature of the Fiore et 
al. sources is the significant (factor of 30) offset between 
the IR and UV-derived SFRs. However, the inherent dif- 
ficulties in determining accurate UV SFRs for luminous, 
heavily obscured galaxies (e.g. Goldadcr et al. 2002, 
Buat et al. 2005), as well as the systematic uncertainties 
of factors of 10-30 in the TIR luminosity (Fiore et al. 
2008), make this the weakest of the 3 arguments. 

Of the 52 Fiore et al. sources in our MIPS sam- 
ple, 10 (19%) are therefore X-ray-selected AGN, and 10 
(19%) are weakly detected in the X-ray, with coadded 
properties consistent with their being obscured AGN. As 
for the remaining 26 sources for which we can test for 
faint X-ray emission, we cannot rule out the presence of 
Compton-thick AGN. However, the lack of coadded hard 
counts from this X-ray non-detected sample, the signif- 
icant fraction of such sources that can be fit by star- 
forming templates, and the Compton-thin nature of the 
X-ray-detected Fiore et al. sources all suggest that many 
of these sources, which make up the remaining 56% of the 
sample, are instead star-forming galaxies. 

8. IMPLICATIONS FOR IR SELECTION OF AGN 

Our evaluation of the performance of the infrared selec- 
tion methods allows a preliminary estimate of the overall 

role of 5piteer-discovered AGN in the total population. 
Of the 109 X-ray sources in the MlPS-selectcd sample, 
95 have redshifts, and of these, 73 (77%) have AGN-like 
X-ray luminosities of (log I/x(ergs s~^)> 42). We there- 
fore assume a sample of 84 (77% of 109) X-ray-selected 
AGN in the MlPS-selected sample. We now consider how 
many IR-selected AGN can be added to this total. 

8.1. Infrared power-law and color- selected AGN 

Of the 55 power-law AGN in our sample, 30 lack cat- 
aloged X-ray counterparts. If we assume that all of 
the X-ray weakly- and non-detected PLGs are AGN, 
the power-law selection criteria increases the number of 
known AGN (84) by 36%. Of the 25/30 X-ray non- 
cataloged sources for which we could test for faint emis- 
sion, 7 (28%) show weak X-ray emission at the > 2a 



confidence level, and 18 show no sign of X-ray emis- 
sion. Correcting for the 5 sources with nearby X-ray 
counterparts therefore gives an estimate of 8.4 weakly- 
detected sources. To place a conservative lower-limit 
on the contribution of X-ray non-detected PLGs to the 
AGN population, we select as AGN those PLGs that are 
either weakly-detected in the X-ray (8.4) or that have 
extremely red slopes of a < —1.0 (8, see §6). Combin- 
ing these two criteria results in a sample of 11.6 AGN 
candidates missed in the X-ray catalogs, for a contribu- 
tion of 14%. We therefore conclude that PLG selection 
increases the number of known MlPS-detcctcd AGN by 
~ 14 — 36%. Further adding the 3 color-selected galaxies 
that lie outside the 3cr star-forming contours and that 
lack cataloged X-ray counterparts (after correcting for 
the fraction with redshifts and for which the distance 
from the star-forming contours could therefore be deter- 
mined) increases the contribution of IRAC-selected AGN 
to ~ 18 - 40%. 

A search for weak X-ray emission from the full sam- 
ple of color-selected galaxies results in the detection of 
44 additional sources, 70% of which have X-ray lumi- 
nosities typical of AGN (sec Table 2). Of these 30 AGN 
candidates, however, 19 (63%) lie at z > 1.75, the red- 
shift above which nearly all (94%) MIPS sources meet 
the Lacy et al. criterion, regardless of their nature 
(AGN/star- forming). Their selection as AGN candidates 
is therefore not primarily a function of their IRAC colors, 
but of their X-ray properties. As such, we do not add 
these additional ~ 30 AGN candidates to our Spitzer- 
selected total. 

8.2. Radio /Infrared- selected AGN 

Radio/infrared selection, in which objects are selected 
for excess radio emission relative to that at 24 jim., pro- 
vides an alternative way to identify AGN independently 
of their optical and X-ray characteristics. In Donley et 
al. (2005), radio-excess AGN are defined as those sources 
with log /24 ^m//i.4GHz < 0. Unhke PLGs, radio-excess 
AGN tend to lie at z ~ 1, have Seyfert-like X-ray lumi- 
nosities of log ix(crgs s~i)~ 42-43, and have NIR SEDs 
dominated by the stellar bump (Donley et al. 2005). 
Of the PLGs detected in the CDF-N, only 3% meet the 
radio-excess criteria (Donley et al. 2007). While there 
is therefore almost no overlap between these two AGN 
populations, their X-ray detection statistics are very sim- 
ilar: only 40% of radio-excess AGN arc cataloged in the 
2 Ms CDF-N. If we consider only those radio-excess AGN 
with 24 /zm flux densities in excess of 80 /iJy, the X-ray 
detection fraction rises to ~ 60%. At this fiux limit, 
the CDF-N sample is complete to radio-excess AGN as 
defined by Donley et al. (2005). 

In the CDF-N, the X-ray and MlPS-detected radio- 
excess galaxies (with X-ray exposures greater than 1 Ms) 
account for 3% of the total number of such sources, and 
their X-ray non-detected counterparts increase the num- 
ber of known AGN by 2%. Only 10-15% of AGN, how- 
ever, are radio-intermediate or radio-loud. This small ob- 
served sample of radio-excess AGN is therefore indicative 
of an underlying population at least 7 times larger, which 
would increase the known population of MlPS-detected 
AGN by ~ 15% if a way could be found to identify them. 

Martinez-Sansigre et al. (2006) also select high- 
redshift obscured AGN candidates via a 24 /um and radio 
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flux cut, although of the 21 AGN candidates chosen by 
them, only 6 (29%) are sufficiently radio loud to meet 
the infrared-to-radio selection criteria used above to de- 
fine radio-excess AGN. Their selection, however, also in- 
cludes a 3.6 /um IRAC cut designed to select red galax- 
ies. Because it is designed for use in shallow surveys, 
only throe galaxies in the GOODS region of the CDF- 
N meet their MIPS and radio flux cuts (using the radio 
data of Richards 2000), and none are red enough to meet 
all three criteria. This selection method therefore does 
not contribute to the AGN sample in the deep CDF-S. 

8.3. IR-excess Galaxies 

As discussed above, the IR-excess samples of Daddi et 
al. (2007a), Dey ct al. (2008), PoUetta et al. (2008), and 
Fiore et al. (2008) contain various fractions of AGN and 
star-forming galaxies, with only the PoUctta et al. se- 
lection criteria unquestionably identifying AGN. Of the 
PoUetta et al. sources, however, 80% are detected in the 
X-ray catalogs, and the remaining source is a weakly- 
detected PLG, whose contribution to the AGN popula- 
tion has already been considered. 

Daddi ct al. (2007a, b) conclude that at least 50% 
of their IR-cxccss galaxies are Compton-thick AGN. If 
this hypothesis holds for the sources in our sample, the 
Daddi et al. selection criteria would contribute > 21 
AGN to the MlPS-selectcd sample in the ISAAC field, 
or > 31 AGN to the full MIPS sample (as the ISAAC 
region comprises only 68% of our full survey area). As 
discussed in §7.1, however, it appears plausible that the 
properties of many of these sources can be attributed 
to star-formation. We therefore add only the weakly- 
detected galaxies, whose mean X-ray luminosity suggests 
a likely AGN origin. Of the 13 weakly-detected Daddi et 
al. IR-excess galaxies, 1 is a weakly-detected PLG and 
1 is a color-selected galaxy that lies > 3a- from the star- 
forming contours, leaving 11 sources whose contribution 
is yet to be counted (or 12.6 when we correct for the 
sources for which we could not test for weak X-ray emis- 
sion). Further scaling to the full sample region results in 
an additional contribution of 18.6 sources, or 22%. 

F. Fiore (private communication, 2008) likewise con- 
cludes that ^ 60% of the X-ray non-cataloged sources in 
our sample are obscured AGN. If so, our MIPS sample in 
the ISAAC field should contain 25 X-ray weakly or non- 
detected Fiore-selected AGN. Of the 36/42 X-ray non- 
cataloged Fiore sources in our sample (which do not lie 
too close to a known X-ray source to test for faint emis- 
sion), however, only 10 (28%) are X-ray weakly-detected 
(with properties indicative of heavy obscuration). The 
remaining 26 sources show no evidence for X-ray emis- 
sion, and have properties that may also be consistent 
with star-formation. We therefore only consider the con- 
tribution from the 10 weakly-detected sources. Of these 
10 sources, 3 are weakly-detected power-law galaxies. 
Correcting for the ISAAC field of view (and the 6 sources 
for which we could not test for weak X-ray emission) re- 
sults in an additional contribution of 12.0 AGN, or 14%. 

8.3.1. Combined Contribution 

By combining the contribution of reliable power-law, 

color-selected, radio/infrared, and IR-excess AGN candi- 
dates, we therefore estimate that 6pzteer-selected sam- 
ples increase the known X-ray-selected AGN population 



by ^ 54 — 77%, down to a 24 /im flux density of 80 /iJy. 
In addition, the radio/infrared selection implies a ^ 17% 
contribution from radio-excess and radio-quiet AGN yet 
to be identified. The number of AGN with 24 fim flux 
densities > 80 /xJy is therefore 71 — 94% larger than that 
of current samples detected both in the X-ray and at 
24 yum in the deepest X-ray fields. 

9. AGN FRACTION OF THE MIR SAMPLE 

We plot in Figure 14a the fraction of the MIPS sam- 
ple comprised of MIR-dominated AGN as a function of 
24 fim flux density. We define as 'MIR-dominated' those 
AGN that (1) meet the PLG criteria, provided that they 
are also X-ray-weakly-detected or have extremely red 
slopes of a < —1.0 (these criteria were used above to 
place a conservative lower limit on the contribution from 
PLCs), (2) meet the IRAC color-color cuts of Lacy et 
al. (2004) or Stern et al. (2005) and ho outside the 
3(T star-forming contours, (3) meet the PoUetta et al. 
(2008) criteria, (4) meet the Fiore et al. (2008) criteria 
and are X-ray cataloged or weakly-detected, or (5) meet 
the Daddi et al. (2007) criteria and are X-ray cataloged 
or weakly-detected. As above, we define as X-ray AGN 
those sources with cataloged X-ray counterparts whoso 
total X-ray luminosities (when redshifts are available) 
exceed 10^^ ergs s~^. Because the PLG, Daddi et al. 
(2007), and Fiore et al. (2008) selection criteria all iden- 
tify AGN candidates with mean redshifts of z ~ 2, the 
vast majority of the IR-selected AGN lie at high redshift 
regardless of their flux density. 

As shown by Brand et al. (2006) , the fraction of MIR 
soiirces dominated by an AGN drops with decreasing flux 
density. We verify this trend, showing excellent agree- 
ment with the Brand et al. results (see Figure 13a), and 
confirm that it continues down to ~ 300 /xJy. At lower 
flux densities, however, the fraction of MIR-dominated 
X-ray AGN begins to decrease at a much lower rate, and 
that of all MIR-dominated AGN plateaus at a value of 
~ 10%. This indicates that a non- negligible fraction of 
faint MIR sources are not powered primarily by star- 
formation, but by their central engines. The significant 
number of faint 24 /xm sources that arc AGN-dominated 
in the MIR is consistent with the finding of 6/19 such 
objects in the spectroscopy of faint 24 /zm sources by 
Rigby et al. (2008). At these low flux densities, it is 
also evident from Figure 14a that the majority of the 
MIR-dominated AGN are not detected in the X-ray. For 
instance, at 80 /iJy, the total number of MIR-dominated 
AGN outnumbers that of MIR-dominated X-ray AGN 
by a factor of ~ 6. Using only the X-ray emission as a 
probe of AGN activity will therefore result in a serious 
underestimation of the AGN contribution to the MIR 
flux density. 

Finally, we plot in Figure 14b the fraction of all AGN 
in the MIPS sample (not only those whose MIR flux is 
dominated by the AGN), as a function of MIPS 24 /xm 

flux density. To account for the contribution of the AGN 
amongst the 14 X-ray sources lacking redshifts, we ran- 
domly add 11, or 77% (the AGN fraction of the X-ray 
sample), of these 14 X-ray sources to the X-ray AGN 
sample. We further supplement the AGN sample by in- 
cluding the assumed contribution from radio-excess and 
radio-quiet AGN, 17%, or 14 sources. Because these 
sources have not been individually identified in the CDF- 
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Fig. 14. — Fraction of MIR-dominatcd AGN, and all AGN (regardless of their contribution to the MIR light), as a function of 24 fim 
flux density. The full definitions of 'MIR-dominated' and 'X-ray AGN' are given in §9. Error bars represent the la errors on the source 
number counts, and the width of the bins. The MIR,-s(1(k lc>d sources missed in the X-ray comprise the majority of the MIR-dominated 
AGN at low flux densities, where the trend towards lower AGN fractions plateaus. 

S, we draw their flux densities randomly from the l aiigo 
of observed flux densities in the CDF-N (80 - 300 /iJy). 

The X-ray non-detected AGN still comprise the ma- 
jority of all AGN at the lowest 24 fim flux densities, and 
their contribution at larger flux densities is significant, 
raising the AGN fraction to > 15% at all 24 /xm flux 
densities. As was seen for the MIR-dominatcd AGN, the 
total fraction of AGN rises with increasing MIR flux den- 
sity, reaching a value of 37% at /24 ^m ^ 800 /iJy, 15% 
higher than that of MIR-dominated AGN of the same 
flux density. The X-ray AGN fractions we find are some- 
what higher than that of Trcistcr et al. (2006). While 
the cause of this offset is not entirely clear, it is likely 
to be due at least in part to cosmic variance, as their 
24 fim sample was drawn from GOODS-N, and ours from 
GOODS-S. Once again, the vast majority of X-ray non- 
detected AGN at all flux densities lie at z ~ 2, the mean 
rcdshift of the sources selected via the PLG, Daddi et 
al. (2007), and Fiore et al. (2008) criteria. Only the 
radio-selected AGN samples lie at systematically lower 
redshifts of 2; ~ 1. 



10. SUMMARY 

Infrared selection of AGN is a powerful technique. Us- 
ing new accurate star-forming and AGN templates along 
with a flux-limited MlPS-sclcctcd sample drawn from the 
GOODS-S field, we critically review three MIR selection 
criteria: (1) the IRAC color cuts of Lacy et al. (2004) 
and Stern et al. (2005), (2) the power-law galaxy (PLG) 
selection technique of Alonso-Herrero et al. (2006) and 
Donley et al. (2007), and (3) the IR-cxccss selection cri- 
teria of Daddi et al. (2007a,b), Dey et al. (2008), Fiore 
etaL (2008), and PoUcttaet al. (2008). From this analy- 
sis, we then quantify the contribution of Spitzer-selected 
AGN to the X-ray selected AGN population. The main 
conclusions of this paper are as follows: 

• The majority of non-power-law IRAC color- 
selected AGN candidates have IR colors consis- 
tent with those of redshift-appropriate star-forming 



templates. In comparison, the majority of PLG 
AGN candidates lie outside of the star-forming con- 
tours. PLG selection recovers the majority of high- 
quality AGN candidates. 

• The reliability of AGN IRAC color-color selection 
improves with increasing fiux as high-redshift star- 
forming galaxies fall out of the sample. Neverthe- 
less, the fraction of potential star-forming contam- 
inants is still high 50%) at the highest fluxes 
probed by our survey (/24 fj,m ~ 500 /uJy). 

• A comparison of the 24 /zm to 3.6 /xm colors of 

the X-ray non-detected PLGs to those of AGN 
and star-forming templates suggests that the X- 
ray non-dctcctcd PLGs, like their X-ray detected 
counterparts, have more hot dust emission than can 
be explained by star-formation alone. 

• An analysis of the Daddi et al. (2007) IR- 

excess sotirces in our MIPS sample indicates that 
while these sources may be Compton-thick AGN, 
it is also possible that they are low-luminosity, 
Compton-thin AGN and/or luminous, highly- 
reddened star- forming galaxies. 

• An X-ray stacking analysis of the sources selected 

via the Fiore et al. (2008) criteria indicate that 
~ 42% of the sources are consistent with being ob- 
scured AGN, and that the remaining 58% may be 
star-forming galaxies. 

• Adding secure Spitzer-selected power-law, color- 
color. radio/IR, and IR-excess AGN candidates to 
the (lec;pest X-ray samples directly increases the 
number of known AGN by ~ 54 — 77%, and im- 
plies a total increase of 71 — 94%. This fraction 
excludes the full contributions from the Daddi et 
al. and Fiore et al. AGN candidates, whose nature 
is still uncertain. 
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• The fraction of MIR sources dominated by an AGN 
decreases with decreasing flux density, but only 
down to a 24 /um flux density of ~ 300 /xJy. Below 
this limit, the AGN fraction levels out at ~ 10%. 
This indicates that a non-negligible fraction of faint 
24 /um sources are primarily powered not by star- 
formation, but by the central engine. In addition, 
the majority of AGN at low 24 ^m flux densities are 
missed in the X-ray, indicating that X-ray emission 
alone cannot be used to identiiy AGN, especially 
amongst faint IR samples. 
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TABLE 1 
Photometric Redshifts 



Sample 


Ngrcs 




Nspcc 


Az 


CTz 


% with 


% with 














Az > 0.10 


Az > 0.20 


All 


713 


649 


249 


0.012 


0.15 


11.3% 


4.6% 


IR-Normal 


448 


424 


187 


0.012 


0.12 


7.2% 


2.8% 


Color-Selected 


210 


196 


51 


0.015 


0.10 


20.4% 


4.1% 


Power-law 


55 


29 


11 


-0.013 


0.48 


40.0% 


40.0% 



TABLE 2 
X-RAY Detection Statistics 





MIPS'' 


MIPS/ISAAC 


Daddi ct al. 


Dcy ct al. 


Fiorc ct al. 


Pollctta ct 


1-24 fini/ fa. 


All 




Sample 


Sample 


(2007) 


(2008) 


(2008) 


al. (2008) 


> 1000 


IR-Excess 


Total 


713 


465 


42 


10 


52 


5 


71 


101 


X-rays 


109 


76 


3 


3 


10 


4 


11 


15 


Power 


25 (100%) 


14 





3 


4 


4 


4 


7 


Color 


35 (79%) 


28 


3 





5 





6 


7 


Noriiiai 


49 ((>."")%) 


31 








1 





1 


1 


Weak X-rays 


157 


125 


13 


2 


10 


1 


16 


26 


Power 


7 (100%) 


5 


1 


1 


3 


1 


4 


4 


Color 


44 (70%) 


35 


11 


1 


7 





11 


20 


Normal 


106 (19%) 


85 


1 











1 


2 


No X-rays 


361 


226 


21 


3 


26 





38 


51 


Power 


18 


12 


3 





7 





9 


11 


Color 


109 


72 


15 


3 


19 





28 


36 


Normal 


234 


142 


3 











1 


4 



Note. — The sum of the weakly and non-detected sources does not equal the full number of X-ray non-cataloged sources. 
The difference represents the number of sources that lie too close to a known X-ray counterpart to test for faint X-ray 
emission. 

^ The fraction of X-ray sources with AGN X-ray luminosities of log Lx(ergs s~^)> 42 is given in parentheses. 



TABLE 3 

Fraction of sources that lie outside the 
star-forming contours 



Selection 


la 


2a 


3a 


Lacy 


52% 


21% 


10% 


Stern 


10%. 


18% 


8% 


Powci-iaw 




79 %.()!%, 


71%,, 13% 


h3x:y 


29% 


12% 


5% 


Stern 


22% 


5% 


0% 


Power-law 


71%,68% 


61%,46% 


57%,39% 



Note. — The upper portion of the table as- 
sumes no errors on the photometric redshifts. 
The lower portion assumes 10% errors. The two 
values given for the PLCs represent the frac- 
tion of sources that lie outside the star-forming 
contours in Lacy and Stern color-space, respec- 
tively. 

APPENDIX 

APPENDIX A: PHOTOMETRIC REDSHIFT TECHNIQUES 

We used two methods to determine photometric redshifts. The first utihzes the extensive high-resolution template set of 
Perez-Gonzalez et al. (2008), which was created by fitting stellar population synthesis and dust emission models to the 
^1500 galaxies in the CDF-N and CDF-S with secure spectroscopic redshifts. When applied to all spcctroscopically- 
dctccted IRAC-sclcctcd galaxies in the GOODS-N and GOODS-S, this template library returns photometric redshifts 
with A(z)<0.1 for 88% of the sources, and A(z)<0.2 for 96%, where A(z) = (zp - Zs)/{1 + Zs) ( Perez-Gonzalez et al. 
2008). Because this method relies on star- forming templates, however, we do not expect it to provide equally reliable 
photometric redshifts for galaxies in which the stellar features are dominated by emission from an AGN. 

Our second method is based on the chi-squared minimization routine HYPERZ (Bolzonella, Miralles, & Pello 2000). 
With a suite of normal star-forming, LIRG/ULIRG, and AGN templates, we can better account for the range of sources 
expected in our 24 /xm selected sample, albeit with a smaller template library. To create this library, we started with 
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TABLE 4 

X-RAY DETECTION FRACTION (AND TOTAL NUMBER) OF 

SfM'RCES THAT LIE OI'THIDL THE HTAR-FORMINC COXTOURS 



Selection 


1 


a 


•2a 




3a 




LsLcy 


21% 


(19) 


26% 


(10) 


42% 


(8) 


Power-law (Lacy) 


70% 


(19) 


83% 


(19) 


90% 


(19) 


Stern 


19% 


(5) 


25% 


(3) 


40% 


(2) 


Power-law (Stern) 


67% 


(16) 


72% 


(13) 


85% 


(11) 


Lacy 


29% 


(15) 


36% 


(8) 


67% 


(6) 


Power-law (Lacy) 


81% 


(17) 


89% 


(16) 


88% 


(15) 


Stern 


14% 


(2) 


0% 


(0) 


0% 


(0) 


Power-law (Stern) 


80% 


(16) 


86% 


(12) 


82% 


(9) 



Note. — The upper portion of the table assumes no errors 
on the photometric redshifts. The lower portion assumes 10% 
errors. 



TABLE 5 
Percent of secure color-selected 

GALAXIES vs. 24 /iM FLUX 



24 fim flux cut 


Lacy 


Stern 


Power-law 


(MJy) 








80 


52% 


40% 


93%, 82% 


100 


54% 


44% 


96%, 84% 


150 


53% 


43% 


94%, 78% 


200 


60% 


63% 


100%, 76% 


300 


53% 


25% 


100%, 72% 


400 


52% 


50% 


100%, 66% 


500 


56% 


50% 


100%, 71% 


80 


29% 


21% 


72%, 68% 


100 


29% 


24% 


76%, 72% 


150 


25% 


21% 


73%, 63% 


200 


32% 


27% 


92%, 76% 


300 


31% 


25% 


90%, 72% 


400 


39% 


50% 


88%, 66% 


500 


50% 


50% 


85%, 71% 



Note. — The upper portion of the table as- 
sumes no errors on the photometric redshifts. 
The lower portion assumes 10% errors. The two 
values given for the PLGs represent the frac- 
tion of sources that lie outside the star-forming 
contours in Lacy and Stern color-space, respec- 
tively. 

a sample of 10 star-forming templates and 8 AGN templates from Silva et al. (1998) and PoUetta et al. (2007). 
The PoUetta et al. AGN templates cover a range of intrinsic obscurations (Type 1 and Type 2) and luminosities 
(Sey/QSO). We then supplemented this sample with 10 empirical star-forming LIRG and ULIRG templates. The full 
template sample is listed in Table 1. The LIRG and ULIRG templates, described in detail in Riekc et al. (2008, in 
prep.), significantly improve upon previous semi-empirical templates by constraining the SEDs between 1 and 6 fim 
with 2MASS and IRAC photometry and by basing the SEDs from 5 to 35 /im on IRS spectra. 

Using this template library, we ran HYPERZ on all robust optical-MIR data for the IR-normal and color-selected 
galaxies. For the PLGs, we removed all photometry longward of 3.6 /zm (IRAC channel 1). Including the full Spitzer 
photometry for PLGs provided little or no additional constraint on potential redshifts, and limited severely the SEDs 
for which a good fit could be found. We allowed z to vary from to 4 and to vary from to 1.2. While its 
shape is still relatively unconstrained, a number of studies have suggested that the extinction curve of AGN most 
closely resembles that of the SMC (e.g. Richards et al. 2003). We therefore assumed by default the SMC extinction 
curve of Bouchet et al. (1985) for the PLGs and color-selected galaxies. For the IR-normal galaxies, we assume the 
Calzetti (2000) extinction curve, as this curve was modeled to represent the extinction properties of starburst galaxies. 
To prevent unrealistic redshift solutions, we applied the redshift- and model-dependent absolute magnitude cuts of 
PoUetta et al. (2007). Finally, to increase the weight of the IRAC photometry, which solely defines the red slope 
of the stellar bump, we set the errors on the IRAC photometry to the measurement errors, as opposed to the total 
photometric errors. This procedure is acceptable because many types of photometric error will have similar effects on 
the four IRAC bands, so we are making use of the overall internal consistency expected for the data. 

To improve the likelihood of obtaining a good fit for each source, we then varied a number of these assumptions, 
and examined by eye the resulting fits to choose the most convincing redshift solution. This visual inspection is an 
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TABLE 1 

Al: Photometric Redshift Templates 



Template 


Type 


Ref 


E112 


2 Gyr old elliptical 


(2) 


E115 


5 Gyr old elliptical 


(2) 


E1113 


13 Gyr old elliptical 


(2^ 


SO 


Spiral 


\^ 1 


Sa 


Spiral a 


(I) 


Sb 


Spiral b 


(I) 


Sc 


Spiral c 




Sd 


Spiral d 




Sdm 


Spiral din 




M82 


Starburst 




NGC 6090 


LIRG /Starburst 


fl) 


ESO320-G030 


LIRG /Starburst 


(3) 


NGC 1614 


LIRG/Starburst 


(3) 


NGC 2639 


LIRG/Starburst 


(3) 


NGC 3256 


LIRG/Starburst 


(3) 


NGC 4194 


LIRG/Starburst 


(3) 


Arp 220 


ULIRG /Starburst 


(1) 


Arp 220 


ULIRG /Starburst 


(3) 


IRAS 12112+0305 


ULIRG/Starburst 


(3) 


IRAS 14348-1447 


ULIRG/Starburst 


(3) 


IRAS 17208-0014 


ULIRG/Starburst 


(3) 


IRAS 22491-1808 


ULIRG/Starburst 


(3) 


IRAS 22491-1808 


ULIRG/Starburst 


(1) 


Mrk 231 


ULIRG/ Seyfert 1 


(1) 


Seyl.8 


Scyfcrt 1.8 


(1) 


TQSOl 


Type 1 QSO 


(1) 


Scy2 


Scyfcrt 2 


(1) 


NGC 6240 


Star burst /Seyfert 2 


(1) 


IRAS 19254-7245 


ULIRG/Seyfert 2 


(1) 


IRAS 20551-4250 


ULIRG/Buried AGN 


(1) 


QS02 


Type 2 QSO 


(1) 



References. — (1) PoUetta et al. (2007), (2) 
Silva et al. (1998), (3) Rieke et al. (2008) 



important characteristic of our work, and was made possible by the relatively small number of sources in our sample. 

First, each PLG and color-selected galaxy was fit by both the SMC and Calzetti extinction laws. In most cases, the 
resulting redshift fits and solutions varied only slightly (in which case wc chose the SMC-dcrivcd fit), but for some 
galaxies, one of the; two extinction laws provided a clearly superior fit as determined by a visual inspcc;tion. Second, 
if the absolute magnitude of the resulting best-fit template did not meet the redshift-dependent absolute magnitude 
cut of PoUetta et al. (2007), we allowed M-q to vary between -23.7 to -17 for star-forming templates and -28.8 to -19 
for AGN templates (e.g. PoUetta et al. 2007). Third, we allowed the IRAC errors to increase to their total estimated 
values by adding a 10% error to the flux; in only 4 cases did this lead to a better fit. Finally, for the PLCs, we explored 
fits that did not include the 3.6 /im IRAC channel; only 2 sources benefited from this change. 

The final step in our redshift estimation was an independent review of the redshifts by two authors. By examining 
by eye the resulting redshift fits, we chose for each PLG and color-selected galaxy the best HYPERZ redshift. We then 
compared this to the redshift fit from the Perez-Gonzalez et al. (2008) technique, and replaced the former with the 
latter if it clearly provided a better fit. For the IR-normal galaxies, we use the Perez-Gonzalez et al. (2008) redshifts by 
default, as these are optimized for normal galaxies. We do, however, remove unconvincing redshift fits, and substitute a 
solid HYPERZ redshift if available. For all sources, we only assign a photometric redshift if a convincing fit exists. We 
rejected 76 sources with poor data or other problems that compromise our redshift determination. The photometric 
redshifts classified as being of high quality are summarized in Table 1 and shown in Figure 1. 

APPENDIX B: REDSHIFT-DEPENDENT COLOR SELECTION 
z = - 0.25 

The lowest redshift bin contains no PLCs and 2 color-selected galaxies, both of which are detected in the X-ray, 
but with low observed X-ray luminosities of log ix(ergs s~^)= 41.3 and log L^{evgs s~^)= 40.9, indicative of powerful 
starbursts or very low-luminosity AGN. 

z = 0.25 - 0.75 

The second redshift bin contains 3 PLGs, 27 Lacy-selected galaxies, and 4 Stern-selected galaxies. Of the Lacy and 
Stern-selected sources, only 41% and 25% are detected in the X-ray, respectively, compared to 100% of the PLGs. 
The average observed 0.5-8 keV X-ray luminosity of the X-ray-detected color-selected galaxies, log Lx(ergs s~-'^)= 
43.0, is consistent with AGN activity, but is an order of magnitude less than that of the PLGs in this redshift bin, 
log Lx(ergs s~^)= 44.1. This is not surprising, as power-law selection preferentially identifies the most luminous AGN 
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wlioro the emission from the central engine is able to overpower that of the host galaxy (Donley et al. 2007). The 
AGN that lie amongst the color-selected sample clearly represent a less luminous AGN population than those selected 
via the PLG criteria. Furthermore, of the 27 Lacy-selected (and 4 Stern-selected) galaxies, only 11 (2) lie outside the 
Icr star-forming contours in their respective color-space, with the number dropping to 3 and 1 (2 and 0) at 2cr and 
3a. This suggests that the majority of the color-selected sources in this redshift bin have infrared colors indicative of 
IR-normal galaxies. While all of the PLGs lie outside even the 3a contours in the Lacy color-space, they lie inside the 
contours in Stern-space, due primarily to the Sdm template of PoUetta et al. (2007), whose strong 3.3 fim aromatic 
feature passes through the 4.5 /im IRAC band at z ~ 0.4, causing the template to enter the Stern AGN selection 
region at z = 0.23 — 0.51. Of the 2 PLGs whose colors overlap with this template, both have spectroscopic redshifts 
of z > 0.67, and would therefore be 'safe' if we considered smaller redshift bins. Removing this template, however, 
would have no effect on the number of Lacy or Stern-selected sources that lie outside the la contours. 

z = 0.75 - 1.25 

We find a higher proportion of secure color-selected galaxies in the z = 0.75—1.25 bin, with 10/12 of the Lacy-selected 
sources and 6/13 of the Stern-selected sources lying outside the la star-forming contours. This is not surprising, as 
it is at z ~ 1 that the star-forming contours are best separated from the AGN selection region. If we extend our test 
to 2a, however, the numbers drop significantly, to 5/12 and 1/13. The majority of the Stern-selected galaxies in this 
redshift bin fill the lower-left corner of the AGN selection region, the region populated by the Stern-only sources whose 
Lacy colors place them in the star-forming locus of color-space. These star-forming galaxies are likely responsible for 
the fact that this is the only redshift bin in which the Stern-selected sources outnumber the Lacy-selected sources. 
Of the 5 PLGs in this redshift bin, all lie outside the la star-forming contours in both Lacy and Stern color-space, 
and while only 3 lie outside of the 2a contours in Stern color-space, all lie outside of the 2a and 3a colors in Lacy 
color-space. 

2 = 1.25 - 1.75 

The number of color-selected galaxies rises significantly in the z — 1.25 — 1.75 redshift bin. The fraction of potential 
IR-selected AGN, however, is by far the lowest at these redshifts, with only 6/38 Lacy-selected sources and 3/31 
Stern-selected sources falling outside the la star-forming contours. In addition, only 3/38 Lacy sources and 4/31 Stern 
sources have X-ray counterparts, further suggesting that nearly all of the color-selected sources at this redshift are 
star-forming galaxies, and not AGN. In contrast, the 2 PLGs found in this redshift bin fall outside of the la contours 
in both the Lacy and Stern plots, and both are detected in the X-ray. 

z = 1.75 - 2.25 

The largest number of Lacy-selected sources, 77, is found in the z = 1.75 — 2.25 bin. In addition, an extraordinarily 
high fraction, 50/77, lie outside of the la star-forming contours, a surprising fact given the X-ray detection fraction 
of only 9%. Have we discovered a significant population of z ~ 2 obscured AGN similar to those claimed by Daddi et 
al. (2007), or is there another explanation for this population? 

The AGN contours (shown in green) provide a better match to the IR colors of these color-selected galaxies than 
do the star- forming templates (shown in blue). At this redshift, the primary difference between the IRAC regions 
of the star-forming and the low-luminosity AGN templates is the strength of the CO index, which is stronger in the 
star-forming galaxies than in the AGN. The MlPS-selected sources therefore may have smaller CO indices than those 
of our local templates. One explanation for this offset is that an underlying AGN continuum has diluted this feature. 
A lower CO index, however, could also be attributed to evolution in the metallicity of the LIRGS/ULIRGS. At 1/3 
solar metallicity, the CO index drops by 4-6 percentage points (McGregor 1987), causing the contours to shift upwards 
in Lacy color space by ^ 0.08. While this lessens this offset between the colors of star-forming contours and the 
observed galaxies, it cannot fully account for the observed discrepancy. Thus, if a change in the CO index is invoked 
for the offset, it is likely that AGN continua are also present. 

However, other possible explanations exist. If we incorporate into the contours a 10% error in (H-z) (recall that the 
measured a for our photometric redshift fits was 0.15 and that 11% of the sources in our spectroscopic redshift sample 
have photometric errors > 10%) the resulting contours are far-better matched to the IR-colors of the color-selected 
galaxies. Of the Lacy- and Stern-selected galaxies, only 17% and 9% now lie outside of the la contours, with the 
numbers dropping to 8% and 0% at 2a and 4% and 0% at 3a, respectively. 

The templates with which the photometric redshifts were best fit can provide further insight into the sources in this 
redshift bin. Of the 90 non-PLGs in the z 2 bin, 75 (86%) are best fit by a star-forming ULIRG template, 9 (10%) 
are best fit by a ULIRG/hidden AGN template, 1 (1%) is best-fit by a type 2 AGN template, and 1 (1%) is best fit 
by a type 1 AGN template. In contrast, all of the PLGs in this redshift range are best-fit by a type 2 AGN template. 
Therefore, while we can not rule out the possibility that we have detected a sample of high-redshift, heavily obscured 
AGN, the extremely low X-ray detection fraction of 9%, the much improved fit of the contours for which 10% errors 
in the photometric redshifts were included, and the overwhelming fraction of sources for which a purely star-forming 
ULIRG provided the best fit to the SED suggest that it is more likely that the vast majority of color-selected galaxies 
in this redshift bin are star-forming galaxies, not AGN. 
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z > 2.25 

The number of color-selected galaxies in the remaining two redshift bins is relatively low: 18 Lacy-selected sources 

and 5 Stern selected sources. Of these, a large fraction (61% and 80%, respectively) lie outside of the star-forming 
contours, which cover a comparatively small portion of the color-space. At 2a and 3(t, the fractions drop to 39% 
and 11% (Lacy) and 20% and 0% (Stern). At these rcdshifts, the X-ray detection fractions of both the color-selected 
sources and the PLGs are low: 17% for the Lacy sources, 40% for the Stern sources, and 23% for the PLCs. At z = 2.5, 
an unobscured AGN with F = 2 requires a rest-frame 0.5-8 keV luminosity of log Lx(ergs s~^)= 42.8 to meet the flux 
limit within 1' of the CDF-S aimpoint. At z = 3, the required value rises to log L^{eigs s~''^)= 43.0, suggesting that if 
these sources are AGN, they must have low luminosities, or high obscuring columns. 

z = unknown 

Not all sources in our sample have redshift estimates. The redshift completeness for the Lacy and Stern color- 
selected samples is high (93% and 90%), respectively), but wc have high-quality redshifts for only 49% of the PLGs. 
The difficulty in fitting redshifts to these sources stems largely from their faint fluxes: the power-law (color-selected) 
galaxies without redshifts are significantly fainter, V = 26.2 (25.3), than those with redshift estimates, V = 25.4 
(V = 24.9), suggesting that these sources may preferentially lie at high redshift. 

The last panels of Figures 5 and 6 show the IRAC colors of sources without redshift estimates. Overplotted are the 
star-forming contours for z = — 4. Of the sources without redshifts, all but 1 lie inside the Lacy selection region 
and all but 9 lie inside the Stern selection region. This is not surprising, as sources with non-stellar continua are the 
hardest sources to fit. Of the 13 Lacy-selected sources, 5 lie outside the la star-forming contours, and 2 have X-ray 
counterparts. Of the 26 PLGs, 11 lie outside the la star-forming contours, and 6 are X-ray-detected. There is a 
noticeable concentration of X-ray non-detected PLGs towards the red end of the power-law locus. These sources are 
discussed in more detail in §6. 



